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GERANYL DIPHOSPHATE SYNTHASE LARGE SUBUNIT, AND 
METHODS OF USE 

Government Rights 

This invention was funded by the United States Department of Energy grant 
5 number DE-FG03-96ER20212. The government has certain rights in the invention. 

Related Applications 
The present application claims benefit of priority from copending, 
international patent application PCT/US98/21772, filed on October 15, 1998, which 
claims benefit of priority from United States patent application serial number 
10 08/951,924, filed on October 16, 1997 (which issued as U.S. Patent Serial 
Number 5,876,964), each of which are incorporated herein by reference. 

Field of the Invention 
The present invention relates to nucleic acid sequences which code for 
geranyl diphosphate synthase large subunit, such as geranyl diphosphate synthase 
15 large subunit from Mentha piperita, and to vectors containing the sequences, host 
cells containing the sequences and methods of producing recombinant geranyl 
diphosphate synthase large subunit and its mutants. 

Background of the Invention 
Geranyl diphosphate synthase (GPP synthase) is one of a family of enzymes 
20 called prenyitransferases that catalyze C5 elongation reactions to form the linear 
(acyclic) precursors of the various terpenoid families. GPP synthase catalyzes the 
condensation of dimethylaliyl diphosphate (DMAPP) and isopentenyl diphosphate 
(IPP) to form geranyl diphosphate (GPP) which is the immediate, Cjo acyclic 
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precursor of the monoterpenes (Wise, M.L. and Croteau, R., in Cane, D.E., ed., 
"Comprehensive Natural Products Chemistry: Isoprenoids, Vol. 2" , Elsevier 
Science, Oxford, 1999. ps. 97-154) (FIG. 1). Famesyl diphosphate synthase (FPP 
synthase), a related prenyltransferase, utilizes GPP and IPP as substrates to form 
5 famesyl diphosphate (FPP), which is the immediate, C15 precursor of the 
sesquiterpenes (FIG. 1). Another prenyltransferase, geranylgeranyl diphosphate 
synthase (GGPP synthase), catalyzes the condensation of famesyl diphosphate (or 
DMA??) and IPP to form geranylgeranyl diphosphate (GGPP) which is the 
immediate C20 precursor of the diterpene family (FIG. 1). Other types of 

10 prenyltransferases can utilize FPP, GGPP and IPP as substrates to form very long 
chain molecules, such as natural rubber. Poulter CD. and RilUng, H.C., Accts. 
Chem. Res. 11: 307-313 (1978); Scolnik, P.A. and Hartley, G., Plant Mol. Biol Rep. 
14: 305, 307 (1996); Ogura, K. and Koyama, T., Chem. Rev. 98: 1263-1276 (1998). 
The basic reaction mechanism for all of these prenyltransferases is the same, 

15 and consists of three steps (see FIG. 2 in which the reaction catalyzed by geranyl 
diphosphate synthase is presented as illustrative of the general reaction mechanism). 
With reference to FIGURE 2, in the first step an allylic diphosphate ester (2a) is 
ionized to the stable carbonium ion (2b). The carbonium ion then attacks the double 
bond of isopentenyi diphosphate (2c) to yield another carbonium ion (2d). In the 

20 final step of the cycle, a proton is eliminated fi-om the newly formed carbonium ion 
(2d) to form a terpenoid containing a new allylic double bond (2e). In the reaction 
catalyzed by GPP synthase, the allylic diphosphate ester is dimethyl allyl 
diphosphate (FIG. 1 and FIG. 2). In the reactions catalyzed by FPP synthase and 
GGPP synthase the allylic diphosphate ester is geranyl diphosphate and famesyl 

25 diphosphate, respectively (FIG. 1). 

Unlike FPP synthase and GGPP synthase, which produce GPP as an 
intermediate and which are nearly ubiquitous (Ogura, K. and Koyama, T., in Ogura, 
K. and Sankawa, U., eds., "Dynamic Aspects of Natural Products Chemistry" 
KodanshaTHarwood Academic Publishers, Tokyo, pp. 1-23, 1997), geranyl 

30 diphosphate synthase is largely restricted to plant species that produce abundant 
quantities of monoterpenes. Because both FPP synthase and GGPP synthase produce 
only negligible levels of GPP as a free intermediate on route to FPP and GGPP 
((Ogura, K. and Koyama, T., in Ogura, K. and Sankawa, U., eds., "Dynamic Aspects 
of Natural Products Chemistry" Kodansha/Harwood Academic Publishers, Tokyo, 

35 pp. 1-23, 1997)), it is geranyl diphosphate synthase that provides the crucial link 
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between primary metabolism and monoterpene biosynthesis and that serves as the 
essential driver of monoterpene biosynthesis (Wise, M.L. and Croteau, R., supra). 

Any attempt, therefore, to exploit recombinant methods to increase the yield 
of monoterpene-producing (essential oil) species, or to genetically engineer the 
monoterpene biosynthetic pathway into any non-producing species {e.g., field crops, 
fruit-bearing plant species and animals) requires access to a geranyl diphosphate 
synthase gene or cDNA clone. Co-expression of geranyl diphosphate synthase along 
with a selected monoterpene synthase, such as (-)-limonene synthase (Colby et al., 
J. Biol. Chem. 268:23016-23024, 1993), and any subsequent pathway en2ymes, 
should allow production of the corresponding monoterpene product(s) from simple 
carbon substrates, such as glucose, in any living organism. 

Monoterpenes are utilized as flavoring agents in food products, and as scents 
in perftimes (Arctander, S., in Perfume and Flavor Materials of Natural Origin, 
Arctander Publications, Elizabeth, New Jersey; Bedoukian, P.Z. in Perfumery and 
Flavoring Materials, 4th edition. Allured Publications, Wheaton, Illinois, 1995; 
Allured, S., in Flavor and Fragrance Materials, Allured Publications, Wheaton, 
Illinois, 1997. Monoterpenes are also used as intermediates in various industrial 
processes. Dawson, F.A., in The Amazing Terpenes, Naval Stores Rev., 
March/April, 6-12, 1994. Monoterpenes are also unplicated in the natural defense 
systems of plants against pests and pathogens. Francke, W. in Muller, P.M. and 
Lamparsky, D., eds.. Perfumes: Art, Science and Technology, Elsevier Applied 
Science, NY, NY, 61-99, 1991; Harbome, J.B., in Harbome, J.B. and Tomas- 
Barberan, F.A., eds.. Ecological Chemistry and Biochemistry of Plant Terpenoids, 
Clarendon Press, Oxford, 399-426, 1991; Gershenzon, J and Croteau, R in Rosenthal, 
G.A. and Berenbaum, M.R., eds.. Herbivores: Their Interactions with Secondary 
Plant Metabolites, Academic Press, San Diego, 168-220, 1991. 

There is also substantial evidence that monoterpenes are effective in the 
prevention and treatment of cancer (Elson, C.E. and Yu, S.G., J. Nutr. 124: 607-614, 
1994.). Thus, for example, limonene, perrilyl alcohol and geranioi have each been 
shown to have chemotherapeutic activity against a very broad range of mammalian 
cancers (see, for example, (1) limonene, Elegbede et al., Carcinogenesis 5:661-665, 
1984; Elson et al., Carcinogenesis 9:331-332, 1988; Maltzman et al.. Carcinogenesis 
10:781-785, 1989; Wattenberg, L.W. and Coccia, J.B., Carcinogenesis 12:115-117, 
1991; Wattenberg, L.W. and Coccia, J.B., Carcinogenesis 12:115-117, 1991; Haag 
etal.. Cancer Res. 52:4021-4026, 1992; Croweil, P.L. and Gould, M.N., CRC Crit. 
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Rev. Oncogenesis 5:1-22, 1994; (2) periliyi alcohol. Mills et al., Cancer Res. 
55:979-983, 1995; Haag, J.D. and Gould, M.N., Cancer Chemother. Pharmacol. 
34:477-483, 1994; Stark etal. Cancer Lett. 96:15-21, 1995 and (3) geraniol, Shoff 
etal.. Cancer Res. 51:37-42, 1991; Yu etal., J. Nutr. 125:2763-2767, 1995; Burke 
5 etal.,Z/pzi& 32:151-156, 1997.). 

Cancer cells can be modified to produce therapeutic amounts of a 
monoterpene having anti-cancer properties by targeting the cognate monoterpene 
synthase protein to cancer cells, or by introducing a monoterpene synthase gene into 
cancer cells. This approach to cancer therapy is complicated, however, by the fact 
10 that the natural distribution of geranyl diphosphate synthase is largely restricted to 
plant species that produce abundant quantities of monoterpenes. Thus, animal cells 
do not naturally produce the monoterpene precursor geranyl diphosphate. 
Consequently, the genetic manipulation of cancer cells to produce endogenous 
monoterpenes having anti-cancer properties requires the introduction of a gene 
15 encoding geranyl diphosphate synthase, together with a gene encoding a 
monoterpene synthase that produces a monoterpene having anti-cancer properties. 
Similarly, if the protein targeting approach is utilized, both geranyl diphosphate 
synthase protein and monoterpene synthase protein must be targeted to cancer cells. 

Standard protein targeting techniques can be used to introduce geranyl 
20 diphosphate synthase along with a monoterpene synthase, such as limonene synthase 
(Colby et al., J. Biol. Chem. 268:23016-23024, 1993), into animal cells with specific 
targeting to tumors. See, e.g., Wearley, L.L., Critical Reviews in Therapeutic Drug 
Carrier Systems, 8(4): 331-394, 1991; Sheldon, K et al., Proc. Natl Acad. Sci. USA., 
92(6): 2056-2060, 1995. In addition, standard gene therapy techniques can be used 
25 to target a GPP synthase gene and a monoterpene synthase gene to cancerous cells 
for endogenous synthesis of monoterpenes having anti-cancer properties. For 
reviews of gene targeting technology see; Mahato R.I et al.. Pharmaceutical 
Research 14(7): 853-859, 1997; Rosenthal, F.M. and Mertelsmann, R., Onkologie 
20(1): 26-34, 1997; Buckel, P., Trends in Pharmacological Sciences 17(12): 
30 450-456, 1996; Roth, J.A. and Cristiano, R.J., J. Nat'l Cancer Inst. 89(1): 21-39, 
1997; Ledley, F.D, Pharmaceutical Research 13(11): 1595-1614, 1996. 

To date, extracts containing geranyl diphosphate synthase activity have been 
isolated from several plant sources, including grape (Clastre et al.. Plant Physiol. 
102:205-211, 1993); geranium (Suga, T. and Endo, T., Phytochemistry 
35 30:1757-1761, 1991); sage (Croteau, R. and Purkett, P.T., Arch. Biochem. Biophys. 



WSUR\14294AP.DOC 



271:524-535, 1989) and Lithospermum (Heide, L. and Berger, U., Arch. Biochem. 
Biophys. 273:331-338, 1989). Only the enzyme from grape has been purified to 
homogeneity (Clastre etal., supra). The structures and properties of 
prenyltransferase enzymes and genes are reviewed in, K. Ogura and T. Koyama, 
Chem. Rev. 98:1263-1276 (1998), and in T. Koyama and K. Ogura, "Isopentenyl 
Diphosphate Isomerase and Prenyltransferases," in Cane, D.E., ed.. Comprehensive 
Natural Products Chemistry: Isoprenoids, Vol. 2, Elsevier Science, Oxford, 1999, 
pp. 69-96. 

U.S. Patent Serial Number 5,876,964 and copending, international patent 
application PCT/US98/21772, each disclose the isolation and sequence of cDNA 
molecules encoding what was mitially characterized as a geranyl diphosphate 
synthase protein. The putative geranyl diphosphate synthase protein exhibited only a 
small fraction of the biological activity of native geranyl diphosphate synthase 
extracted from plant tissue. In view of the disclosure of the present patent 
application, the putative geranyl diphosphate synthase protein disclosed in U.S. 
Patent Serial Number 5,876,964, and in copending, international patent application 
PCTAJS98/21772, is now known to be the small subunit of geranyl diphosphate 
synthase which exists, in its native, frilly functional form, as a heterodimer including 
a small subunit and a large subunit. The present patent application discloses the 
isolation and sequence of a Mentha cDNA encoding geranyl diphosphate synthase 
large subunit, and enables the isolation of additional nucleic acid molecules encoding 
geranyl diphosphate synthase large subunit. 

Summary of the Invention 
In accordance with the foregoing, a cDNA encoding geranyl diphosphate 
synthase large subunit from peppermint has been isolated and sequenced, and the 
corresponding amino acid sequence has been deduced. Accordingly, the present 
invention relates to isolated, recombinant geranyl diphosphate synthase large subunit 
proteins, to isolated DNA sequences which code for the expression of geranyl 
diphosphate synthase large subunit, such as the sequence designated SEQ IDNOil 
which encodes geranyl diphosphate synthase large subunit (SEQ ID NO:2) from 
peppermint {Mentha piperita). In other aspects, the present invention is directed to 
replicable recombinant cloning vehicles comprising a nucleic acid sequence, e.g., a 
DNA sequence which codes for a geranyl diphosphate synthase large subunit or for a 
base sequence sufficiently complementary to at least a portion of the geranyl 
diphosphate synthase large subunit DNA or RNA to enable hybridization therewith 
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{e.g., antisense geranyl diphosphate synthase large subunit RNA or fragments of 
complementary geranyl diphosphate synthase large subunit DNA which are useful as 
polymerase chain reaction primers or as probes for geranyl diphosphate synthase 
large subunit genes, or related genes). In yet other aspects of the invention, modified 

5 host cells are provided that have been transformed, transfected, infected and/or 
injected with a recombinant cloning vehicle and/or DNA sequence of the invention. 
Thus, the present invention provides for the recombinant expression of geranyl 
diphosphate synthase large subunit. The inventive concepts described herein may be 
used to facilitate the production, isolation and purification of significant quantities of 

10 recombinant geranyl diphosphate synthase large subunit for subsequent use, to obtain 
expression or enhanced expression of geranyl diphosphate synthase large subunit in 
plants, microorganisms or animals, or may be otherwise employed in an environment 
where the regulation or expression of geranyl diphosphate synthase large subunit is 
desired, for example for the production of the enzyme product of geranyl diphosphate 

1 5 synthase heterodimer, geranyl diphosphate, or its derivatives. 

In another aspect, the present invention provides isolated, recombinant 
geranyl diphosphate synthase heterodimer protein comprising an isolated, 
recombinant geranyl diphosphate synthase large subunit protein and an isolated, 
recombinant geranyl diphosphate synthase small subunit protein. 

20 In yet another aspect of the present invention, methods are provided for 

treating cancer. The cancer treatment methods include the step of introducing a 
geranyl diphosphate synthase small subunit protein into a cancer cell, together with a 
monoterpene synthase protein that is capable of converting geranyl diphosphate to a 
monoterpene having anticancer properties. More preferably, nucleic acid sequences 

25 encoding a geranyl diphosphate synthase small subunit protein and a monoterpene 
synthase protein (that is capable of converting geranyl diphosphate to a monoterpene 
having anticancer properties) are introduced into a cancer cell under conditions that 
enable expression of the geranyl diphosphate synthase small subunit and 
monoterpene synthase proteins. It is understood that a single nucleic acid molecule 

30 can encode both the geranyl diphosphate synthase small subunit and monoterpene 
synthase proteins. 

In a presently preferred embodiment, the cancer treatment methods of the 
present invention include the step of introducing a geranyl diphosphate synthase 
large subunit protein and a geranyl diphosphate synthase small subunit protein into a 
35 cancer cell, together with a monoterpene synthase protein that is capable of 



WSUR\14294AP.DOC 



-7- 



' converting geranyl diphosphate to a monoterpene having anticancer properties. More 
preferably, nucleic acid molecules encoding a geranyl diphosphate synthase small 
subimit protein, a geranyl diphosphate synthase large subunit protein and a 
monoterpene synthase protein (that is capable of converting geranyl diphosphate to a 
5 monoterpene having anticancer properties) are introduced into a cancer cell under 
conditions that enable expression of the geranyl diphosphate synthase small subunit, 
large subunit and monoterpene synthase proteins. It is understood that a single 
nucleic acid molecule can encode the geranyl diphosphate synthase small subunit, the 
geranyl diphosphate synthase large subunit and monoterpene synthase proteins. 
10 Brief Description of the Drawings 

The foregoing aspects and many of the attendant advantages of this invention 
will become more readily appreciated as the same become better understood by 
reference to the following detailed description, when taken in conjunction with the 
accompanying drawings, wherein: 
15 FIGURE 1 shows the condensation reactions catalyzed by (a) geranyl 

diphosphate synthase, (b) famesyl diphosphate synthase and (c) geranylgeranyl 
diphosphate synthase. 

FIGURE 2 shows the reaction mechanism common to all prenyltransferases. 
The reaction catalyzed by geranyl diphosphate synthase is presented as illustrative of 
20 the general mechanism. 

Detailed Description of the Preferred Embodiment 
As used herein, the terms "amino acid" and "amino acids" refer to all 
naturally occurring L-a-amino acids or their residues. The amino acids are identified 
by either the single-letter or three-letter designations: 



25 


Asp 


D 


aspartic acid 


He 


I 


isoleucine 




Thr 


T 


threonine 


Leu 


L 


leucine 




Ser 


S 


serine 


Tyr 


Y 


tyrosine 




Glu 


E 


glutamic acid 


Phe 


F 


phenylalanine 




Pro 


P 


proline 


His 


H 


histidine 


30 


Gly 


G 


glycine 


Lys 


K 


lysine 




Ala 


A 


alanine 


Arg 


R 


arginine 




Cys 


C 


cysteine 


Trp 


W 


tryptophan 




_ Val 


V 


valine 


Gin 


Q 


glutamine 




Met 


M 


methionine 


Asn 


N 


asparagine 
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As used herein, the term "nucleotide" means a monomeric unit of DNA or 
RNA containing a sugar moiety (pentose), a phosphate and a nitrogenous 
heterocyclic base. The base is linked to the sugar moiety via the glycosidic carbon 
(r carbon of pentose) and that combination of base and sugar is called a nucleoside. 
5 The base characterizes the nucleotide with the four bases of DNA being adenine 
("A"), guanine ("G"), cytosine ("C") and thymine ("T"). Inosine ("I") is a synthetic 
base that can be used to substitute for any of the four, naturally-occurring bases (A, 
C, G or T). The four RNA bases are A,G,C and uracil ("U"). The nucleotide 
sequences described herein comprise a linear array of nucleotides connected by 
1 0 phosphodiester bonds between the 3' and 5' carbons of adjacent pentoses. 

The term "percent identity" (%I) means the percentage of amino acids or 
nucleotides that occupy the same relative position when two amino acid sequences, 
or two nucleic acid sequences, are aligned side by side. 

The term "percent similarity" (%S) is a statistical measure of the degree of 
15 relatedness of two compared protein sequences. The percent similarity is calculated 
by a computer program that assigns a numerical value to each compared pair of 
amino acids based on observed amino acid replacements in closely related sequences. 
Calculations are made after a best fit alignment of the two sequences has been made 
empirically by iterative comparison of all possible alignments. (Henikoff, S. and 
20 Henikoff, J.G., Proc. Nat'l Acad Sci C/S^ 89 : 1 09 1 5- 1 09 1 9, 1 992). 

"Oligonucleotide" refers to short length single or double stranded sequences 
of deoxyribonucleotides linked via phosphodiester bonds. The oligonucleotides are 
chemically synthesized by known methods and purified, for example, on 
polyacrylamide gels. 

25 The term "geranyl diphosphate synthase" is used herein to mean an enzyme 

capable of catalyzing the condensation of dimethylallyl diphosphate (DMAPP) and 
isopentenyl diphosphate (IPP) to form geranyl diphosphate, the immediate acyclic 
precursor of the monoterpenes, as described herein. In its folly-functional, naturally- 
occurring form, geranyl diphosphate synthase exists as heterodimer composed of a 

30 geranyl diphosphate synthase large subunit and a geranyl diphosphate synthase small 
subunit. 

The term "essential oil plant," or "essential oil plants," refers to a group of 
plant species that produce high levels of mono terpenoid and/or sesquiterpenoid 
and/or diterpenoid oils, and/or high levels of monoterpenoid and/or sesquiterpenoid 
35 and/or diterpenoid resins. The foregoing oils and/or resins account for greater than 
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about 0.005% of the fresh weight of an essential oil plant that produces them. The 
essential oils and/or resins are more fully described, for example, in E. Guenther, The 
Essential Oils, Vols. I-VI, R.E. Krieger Publishing Co., Huntington N.Y., 1975, 
incorporated herein by reference. The essential oil plants include, but are not limited 
5 to: 

Lamiaceae, including, but not limited to, the following species: Ocimum 
(basil), Lavandula (Lavender), Origanum (oregano), Mentha (mint), Salvia (sage), 
Rosmarinus (rosemary), Thymus (thyme), Satureja and Monarda. 

Umbelliferae, including, but not limited to, the following species: Carum 
1 0 (caraway), Anethum (dill), feniculum (fennel) and Daucus (carrot). 

Asteraceae (Compositae), including, but not limited to, the following species: 
Artemisia (tarragon, sage brush), Tanacetum (tansy). 

Rutaceae (e.g., citrus plants); Rosaceae (e.g., roses); Myrtaceae (e.g., 
eucalyptus, Melaleuca); the Graraineae (e.g., Cymbopogon (citronella)); Geranaceae 
15 (Geranium) and certain conifers including Abies (e.g., Canadian balsam), Cedrus 
(cedar). Thuja, Pinus (pines) and Juniperus. 

The range of essential oil plants is more fully set forth in E. Guenther, The 
Essential Oils, Vols. I-VI, R.E. Krieger Publishing Co., Huntington N.Y., 1975, 
which is incorporated herein by reference. 
20 The term "angiosperm" refers to a class of plants that produce seeds that are 

enclosed in an ovary. 

The term "gymnosperm" refers to a class of plants that produce seeds that are 
not enclosed in an ovary. 

The terms "alteration", "amino acid sequence alteration", "variant" and 
25 "amino acid sequence variant" refer to geranyl diphosphate synthase large subunit 
molecules with some differences in their amino acid sequences as compared to native 
geranyl diphosphate synthase large subunit. Ordinarily, the variants will possess at 
least about 70% homology with native geranyl diphosphate synthase large subunit, 
and preferably they will be at least about 80% homologous with native geranyl 
30 diphosphate synthase large subunit. The amino acid sequence variants of geranyl 
diphosphate synthase large subunit falling within this invention possess substitutions, 
deletions, and/or insertions at certain positions. Sequence variants of geranyl 
diphosphate synthase large subunit may be used to attain desired enhanced or 
reduced enzymatic activity, or altered substrate utilization or product distribution of 
35 the geranyl diphosphate synthase heterodimer. 
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Substitutional geranyl diphosphate synthase large subunit variants are those 
that have at least one amino acid residue in the native geranyl diphosphate synthase 
large subunit sequence removed and a different amino acid inserted in its place at the 
same position. The substitutions may be single, where only one amino acid in the 
5 molecule has been substituted, or they may be multiple, where two or more amino 
acids have been substituted in the same molecule. Substantial changes in the activity 
of the geranyl diphosphate synthase large subunit molecule may be obtained by 
substituting an amino acid with a side chain that is significantly different in charge 
and/or structure from that of the native amino acid. This type of substitution would 

10 be expected to affect the structure of the polypeptide backbone and/or the charge or 
hydrophobicity of the molecule in the area of the substitution. 

Moderate changes in the activity of the geranyl diphosphate synthase large 
subunit molecule woxild be expected by substituting an amino acid with a side chain 
that is similar in charge and/or structure to that of the native molecule. This type of 

15 substitution, referred to as a conservative substitution, would not be expected to 
substantially alter either the structure of the polypeptide backbone or the charge or 
hydrophobicity of the molecule in the area of the substitution. 

Insertional geranyl diphosphate synthase large subunit variants are those with 
one or more amino acids inserted immediately adjacent to an amino acid at a 

20 particular position in the native geranyl diphosphate synthase large subunit molecule. 
Immediately adjacent to an amino acid means connected to either the a-carboxy or 
a-amino functional group of the amino acid. The insertion may be one or more 
amino acids. Ordinarily, the insertion will consist of one or two conservative amino 
acids. Amino acids similar in charge and/or structure to the amino acids adjacent to 

25 the site of insertion are defined as -conservative. Alternatively, this invention 
includes insertion of an amino acid with a charge and/or structure that is substantially 
different from the amino acids adjacent to the site of insertion. 

Deletional variants are those where one or more amino acids in the native 
geranyl diphosphate synthase large subunit molecule have been removed. 

30 Ordinarily, deletional variants will have one or two amino acids deleted in a 
particular region of the geranyl diphosphate synthase large subunit molecule. 

The terms "biological activity", "biologically active", "activity" and "active," 
when used with reference to geranyl diphosphate synthase, refer to the ability of the 
geranyl diphosphate synthase heterodimer to condense dimethylallyl diphosphate 

35 (DMAPP) and isopentenyl diphosphate (IPP) to form geranyl diphosphate, as 
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measured in an enzyme activity assay, such as the assay described in Example 1 
below. Amino acid sequence variants of geranyl diphosphate synthase {i.e., amino 
acid sequence variants of either or both of the large subunit or the small subunit) may 
have desirable, altered biological activity including, for example, altered reaction 
5 kinetics, substrate utilization product distribution or other characteristics. 

The terms "DNA sequence encoding", "DNA encoding" and "nucleic acid 
encoding" refer to the order or sequence of deoxyribonucleotides along a strand of 
deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order 
of amino acids along the translated polypeptide chain. The DNA sequence thus 
10 codes for the amino acid sequence. 

The terms "replicable expression vector" and "expression vector" refer to a 
piece of DNA, usually double-stranded, which may have inserted into it a piece of 
foreign DNA. Foreign DNA is defmed as heterologous DNA, which is DNA not 
naturally found in the host. The vector is used to transport the foreign or 
15 heterologous DNA into a suitable host cell. Once in the host cell, the vector can 
replicate independently of or coincidental with the host chromosomal DNA, and 
several copies of the vector and its inserted (foreign) DNA may be generated. In 
addition, the vector contains the necessary elements that permit translating the 
foreign DNA into a polypeptide. Many molecules of the polypeptide encoded by the 
20 foreign DNA can thus be rapidly synthesized. 

The terms "transformed host cell," "transformed" and "transformation" refer 
to the introduction of DNA into a cell. The cell is termed a "host cell", and it may be 
a prokaryotic or a eukaryotic cell. Typical prokaryotic host cells include various 
strains of E. coli. Typical eukaryotic host cells are plant cells, such as maize cells, 
25 yeast cells, insect cells or animal cells. The introduced DNA is usually in the form of 
a vector containing an inserted piece of DNA. The introduced DNA sequence may 
be from the same species as the host cell or from a different species from the host 
cell, or it may be a hybrid DNA sequence, containing some foreign DNA and some 
DNA derived from the host species. 
30 In accordance with the present invention, a cDNA encoding geranyl 

diphosphate synthase large subunit was isolated and sequenced in the following 
manner. Geranyl diphosphate synthase large subunit is located exclusively in the 
glandular trichome secretory cells and interacts with geranyl diphosphate synthase 
small subunit to form geranyl diphosphate synthase which catalyzes the formation of 
35 geranyl diphosphate in these essential oil species. These secretory cell clusters were 
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isolated from Mentha spicata and geranyl diphosphate synthase large subunit was 
purified therefrom utilizing a purification protocol consisting of a dye-ligand 
chromatography step, and an anion exchange chromatography step followed by 
preparative SDS-PAGE. The limited amount of purified geranyl diphosphate 
5 synthase large subunit yielded six peptide fragments when digested with trypsin, and 
the sequence of four of these peptide fragments was determined, peptide 1 (SEQ ID 
N0:3), peptide 2 (SEQ ID N0:4), peptide 3 (SEQ ID N0:5) and peptide 4 (SEQ ID 
NO: 6). This sequence information was insufficient to permit a reverse genetic 
approach to cloning the geranyl diphosphate synthase large subunit cDNA, i.e., there 

10 was insufficient amino acid sequence to permit the construction of degenerate 
oligonucleotide probes that were sufficiently specific to be effective as probes (these 
sequences were too degenerate to permit the design of specific PGR primers). 

Consequently, total RNA was extracted from isolated trichome secretory cells 
derived from Mentha piperita and mRNA was purified therefrom. The secretory cell 

15 mRNA served as the substrate for the synthesis of a cDNA library by standard 
means. One hundred and thirty, randomly selected cDNA clones were sequenced 
and one clone showed homology to plant-derived geranylgeranyl diphosphate 
synthases (-67-83% identity; -74-93% similarity). Sequence information derived 
from this "prenyltransferase-like" cDNA was used to construct PGR primers GG23F 

20 (SEQ ID NO:7) and GG23R (SEQ ID N0:8) which were, in turn, used to amplify a 
101 bp fragment (SEQ ID N0:9) of the 5'-end of the geranylgeranyl diphosphate 
synthase-like cDNA. The 101 bp fragment (SEQ ID NO: 9) was radiolabelled and 
used as a probe to screen a mint oil gland cDNA library. Ten positive clones were 
purified through a second round of screening and were sequenced to yield the full- 

25 length cDNA insert of pMp23.10 (SEQ ID N0:1). This clone was initially 
considered to encode geranylgeranyl diphosphate synthase, but the expressed protein 
(SEQ ID NO:2) did not yield a functional GGPP synthase; however, aligrmient of the 
four peptide sequences, derived firom the purified 37 kDa protein, obtained by 
purification of mint geranyl diphosphate synthase, with the deduced amino acid 

30 sequence of pMp23.10 (SEQ ID NO:2), revealed that peptide 1 (LIGVE) (SEQ ID 
NO:3) corresponded exactly to deduced amino acid residues 333 to 337 of SEQ ID 
NO:2, that peptide 2 (YIAYR) (SEQ ID N0:4) corresponded exactly to deduced 
amino acid residues 371 to 375 of SEQ ID NO:2, that peptide 3 
(TAALLTGSVVLGAIL) (SEQ ID N0:5) corresponded to residues 263 to 277 of 

35 SEQ ID NO:2 and peptide 4 (EAVETLLHF) (SEQ ID N0:6) to residues 349-357 of 
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SEQ ID NO:2. These results suggested that the nucleic acid molecule of SEQ ID 
N0:1 encoded a GPP synthase. 

Celi-free extracts of bacteria harboring both a geranyl diphosphate synthase 
large subunit clone (SEQ ID NO:l) and a geranyl diphosphate synthase small subunit 
clone (SEQ ID NO: 10), encoding the protein of SEQ ID NO: 11 (the geranyl 
diphosphate synthase small subunit clone is designated Mpl3.18, and is folly 
disclosed and described m U.S. Patent Serial Number 5,876,964 and m copending, 
international patent application PCT/US98/21772, both of which are incorporated 
herem by reference) yielded levels of prenyltransferase activity significantly higher 
than the corresponding empty vector controls, and separation of activities by ion- 
exchange chromatography revealed the presence of a prenyltransferase that eluted at 
>90 mM KCl and that was absent in preparations from the controls. This new, 
recombinant prenyltransferase was confirmed to be geranyl diphosphate synthase by 
radio-gas chromatographic analysis demonstrating the exclusive production of the 
C|o product. 

The isolation of a cDNA encodmg geranyl diphosphate synthase large subunit 
permits the development of an efficient expression system for this protein, provides a 
usefol tool for examining the developmental regulation of monoterpene biosynthesis 
and permits the isolation of other geranyl diphosphate synthase large subunits. The 
isolation of a geranyl diphosphate synthase large subunit cDNA also permits the 
transformation of a wide range of organisms in order to introduce monoterpene 
biosynthesis de novo, or to modify endogenous monoterpene biosynthesis. Further, 
the isolation of a geranyl diphosphate synthase large subunit cDNA also permits 
coexpression of the large and small subunits of geranyl diphosphate synthase in a 
host cell to form fully functional, recombinant geranyl diphosphate synthase 
heterodimer. 

Although the geranyl diphosphate synthase large subimit protein set forth in 
SEQ ID N0:2 directs the enzyme to plastids, substitution of the putative targeting 
sequence (SEQ ID N0:2, amino acids 1 to 48 ) witli other transport sequences well 
known in the art (see, e.g., von Heijne G et al., Eur. J. Biochem 180: 535-545, 1989; 
Stryer, Biochemistry W.H. Freeman and Company, New York, NY, p. 769 [1988]) 
may be employed to direct tlie geranyl diphosphate synthase large subunit to other 
cellular or extracellular locations. 

In addition to the native geranyl diphosphate synthase large subunit amino 
acid sequence of SEQ lDNO:2 encoded by the cDNA insert of plasmid Mp 23.10 
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(SEQIDN0:1), sequence variants produced by deletions, substitutions, mutations 
and/or insertions are intended to be within the scope of the invention except insofar 
as limited by the prior art. Geranyl diphosphate synthase large subunit amino acid 
sequence variants may be constructed by mutating the DNA sequence that encodes 
5 wild-type geranyl diphosphate synthase large subunit, such as by using techniques 
commonly referred to as site-directed mutagenesis. Various polymerase chain 
reaction (PGR) methods now well known in the field, such as a two primer system 
like the Transformer Site-Directed Mutagenesis kit from Clontech, may be employed 
for this purpose. 

10 Following denaturation of the target plasmid in this system, two primers are 

O simultaneously annealed to the plasmid; one of these primers contains the desired 

% site-directed mutation, the other contains a mutation at another point in the plasmid 

ly resulting in elimination of a restriction site. Second strand synthesis is then carried 

I;*' out, tightly linking these two mutations, and the resulting plasmids are transformed 

SJ 1 5 into a mutS strain of E. coli. Plasmid DNA is isolated from the transformed bacteria, 
^ restricted with the relevant restriction enzyme (thereby linearizing the unmutated 

:f=l plasmids), and then retransformed into E. coli. This system allows for generation of 

5© mutations directly in an expression plasmid, without the necessity of subcloning or 

generation of single-stranded phagemids. The tight linkage of the two mutations and 
f3 20 the subsequent linearization of unmutated plasmids results in high mutation 

efficiency and allows minimal screening. Following synthesis of the initial 
restriction site primer, this method requires the use of only one new primer type per 
mutation site. Rather than prepare each positional mutant separately, a set of 
"designed degenerate" oligonucleotide primers can be synthesized in order to 
25 introduce all of the desired mutations at a given site simultaneously. Transformants 
can be screened by sequencing the plasmid DNA through the mutagenized region to 
identify and sort mutant clones. Each mutant DNA can then be restricted and 
•analyzed by electrophoresis on Mutation Detection Enhancement gel (J.T. Baker) to 
confirm that no other alterations in the sequence have occurred (by band shift 
30 comparison to the unmutagenized control). 

The verified mutant duplexes can be cloned into a replicable expression 
vector, if not already cloned into a vector of this type, and the resulting expression 
construct used to transform E. coli, such as strain E. coli BL21(DE3)pLysS, for high 
level production of the mutant protein, and subsequent purification thereof The 
35 method of FAB-MS mapping can be employed to rapidly check the fidelity of mutant 
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expression. This technique provides for sequencing segments throughout the whole 
protein and provides the necessary confidence in the sequence assignment. In a 
mapping experiment of this type, protein is digested with a protease (the choice will 
depend on the specific region to be modified since this segment is of prime interest 
5 and the remaining map should be identical to the map of unmutagenized protein). 
The set of cleavage fragments is fractionated by microbore HPLC (reversed phase or 
ion exchange, again depending on the specific region to be modified) to provide 
several peptides in each fraction, and the molecular weights of the peptides are 
determined by FAB-MS. The masses are then compared to the molecular weights of 

1 0 peptides expected from the digestion of the predicted sequence, and the correctness 
of the sequence quickly ascertained. Since this mutagenesis approach to protein 
modification is directed, sequencing of the altered peptide should not be necessary if 
the MS agrees with prediction. If necessary to verify a changed residue, 
CAD-tandem MS/MS can be employed to sequence the peptides of the mixture in 

15 question, or the target peptide purified for subtractive Edman degradation or 
carboxypeptidase Y digestion depending on the location of the modification. 

In the design of a particular site directed mutant, it is generally desirable to 
first make a non-conservative substitution (e.g., Ala for Cys, His or Glu) and 
determine if activity is greatly impaired as a consequence. The properties of the 

20 mutagenized protein are then examined with particular attention to the kinetic 
parameters of and k^^i as sensitive indicators of altered function, from which 
changes in binding and/or catalysis per se may be deduced by comparison to the 
native enzyme. If the residue is by this means demonstrated to be important by 
activity impairment, or knockout, then conservative substitutions can be made, such 

25 as Asp for Glu to alter side chain length, Ser for Cys, or Arg for His. For 
hydrophobic segments, it is largely size that will be altered, although aromatics can 
also be substituted for alkyl side chains. Changes in the normal product distribution 
can indicate which step(s) of the reaction sequence have been altered by the 
mutation. 

30 Other site directed mutagenesis techniques may also be employed with the 

nucleotide sequences of the invention. For example, restriction endonuclease 
digestion of DNA followed by ligation may be used to generate geranyl diphosphate 
synthase large subunit deletion variants, as described in section 15.3 of 
Sambrook et al. {Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring 
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Harbor Laboratory Press, New York, NY [1989]). A similar strategy may be used to 
construct insertion variants, as described in section 15.3 of Sambrook et al., supra. 

Oligonucleotide-directed mutagenesis may also be employed for preparing 
substitution variants of this invention. It may also be used to conveniently prepare 
5 the deletion and insertion variants of this invention. This technique is well known in 
the art as described by Adelmanetal. {DNA 2:183 [1983]). Generally, 
oligonucleotides of at least 25 nucleotides in length are used to insert, delete or 
substitute two or more nucleotides in the geranyl diphosphate synthase large subunit 
molecule. An optimal oligonucleotide will have 12 to 15 perfectly matched 

1 0 nucleotides on either side of the nucleotides coding for the mutation. To mutagenize 
the wild-type geranyl diphosphate synthase large subunit, the oligonucleotide is 
annealed to the single-stranded DNA template molecule under suitable hybridization 
conditions. A DNA polymerizing enzyme, usually the Klenow fragment of E. coli 
DNA polymerase 1, is then added. This enzyme uses the oligonucleotide as a primer 

15 to complete the synthesis of the mutation-bearing strand of DNA. Thus, a 
heteroduplex molecule is formed such that one strand of DNA encodes the wild-type 
geranyl diphosphate synthase large subunit inserted in the vector, and the second 
strand of DNA encodes the mutated form of geranyl diphosphate synthase large 
subunit inserted into the same vector. This heteroduplex molecule is then 

20 transformed into a suitable host cell. 

Mutants with more than one amino acid substituted may be generated in one 
of several ways. If the amino acids are located close together in the polypeptide 
chain, they may be mutated simultaneously using one oligonucleotide that codes for 
all of the desired amino acid substitutions. If however, the amino acids are located 

25 some distance from each other (separated by more than ten amino acids, for example) 
it is more difficult to generate a single oligonucleotide that encodes all of the desired 
changes. Instead, one of two alternative methods may be employed. In the first 
method, a separate oligonucleotide is generated for each amino acid to be substituted. 
The oligonucleotides are then armealed to the single-stranded template DNA 

30 simultaneously, and the second strand of DNA that is synthesized from the template 
will encode all of the desired amino acid substitutions. An alternative method 
involves two or more rounds of mutagenesis to produce the desired mutant. The first 
round is as described for the single mutants: wild-type geranyl diphosphate synthase 
large subunit DNA is used for the template, an oligonucleotide encoding the first 

35 desired amino acid substitution(s) is annealed to this template, and the heteroduplex 
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DNA molecule is then generated. The second round of mutagenesis utilizes the 
mutated DNA produced in the first roxmd of mutagenesis as the template. Thus, this 
template already contains one or more mutations. The oligonucleotide encoding the 
additional desired amino acid substitution(s) is then annealed to this template, and 
5 the resulting strand of DNA now encodes mutations from both the first and second 
roimds of mutagenesis. This resultant DNA can be used as a template in a third 
round of mutagenesis, and so on. 

The gene, or other nucleic acid molecule, encoding geranyi diphosphate 
synthase large subunit may be incorporated, together with a nucleic acid molecule 

10 encoding geranyi diphosphate synthase small subunit (separately or operationally 
linked), into any organism (intact plant, animal, microbe), or cell culture derived 
therefrom, that produces dimethylallyl diphosphate and isopentenyl diphosphate to 
effect the conversion of these primary substrates to geranyi diphosphate and its 
subsequent metabolic products, depending on the organism. The geranyi 

15 diphosphate synthase large subunit gene, together with a nucleic acid molecule 
encoding geranyi diphosphate synthase small subunit, may be introduced into any 
organism for a variety of purposes including, but not limited to: production or 
modification of flavor and aroma properties; improvement of defense capability; the 
alteration of other ecological interactions mediated by geranyi diphosphate and its 

20 derivatives; selective destruction or inhibition of the growth, development or division 
of cancerous cells; or the production of geranyi diphosphate and its derivatives. 

Eukaryotic expression systems may be utilized for geranyi diphosphate 
synthase large subunit production since they are capable of carrying out any required 
posttranslational modifications and of directing the enzyme to the proper membrane 

25 location. A representative eukaryotic expression system for this purpose uses the 
recombinant bacuio virus, Autographa califomica nuclear polyhedrosis virus 
(AcNPV; M.D. Summers and G.E. Smith, A Manual of Methods for Baculovirus 
Vectors and Insect Cell Culture Procedures [1986]; Luckow et al.. Bio-technology 
6:47-55 [1987]) for expression of the geranyi diphosphate synthase large subunit of 

30 the invention. Infection of insect cells (such as cells of the species Spodoptera 
frugiperda) with the recombinant baculoviruses allows for the production of large 
amounts of the geranyi diphosphate synthase large subunit protein. In addition, the 
baculovirus system has other important advantages for the production of recombinant 
geranyi diphosphate synthase large subunit. For example, baculoviruses do not infect 

35 humans and can therefore be safely handled in large quantities. In the baculovirus 
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system, a DNA construct is prepared including a DNA segment encoding geranyl 
diphosphate synthase large subunit and a vector. The vector may comprise the 
polyhedron gene promoter region of a baculovirus, the baculovirus flanking 
sequences necessary for proper cross-over during recombination (the flanking 
5 sequences comprise about 200-300 base pairs adjacent to the promoter sequence) and 
a bacterial origin of replication which permits the construct to replicate in bacteria. 
The vector is constructed so that (i) the DNA segment is placed adjacent (or operably 
linked or "downstream" or "under the control of) to the polyhedron gene promoter 
and (ii)the promoter/geranyl diphosphate synthase large subunit (and/or small 

10 subunit) combination is flanked on both sides by 200-300 base pairs of baculovirus 
DNA (the flanking sequences). 

To produce the geranyl diphosphate synthase large subunit DNA construct, a 
cDNA clone encoding the full length geranyl diphosphate synthase large subunit is 
obtained using methods such as those described herein. The DNA construct is 

15 contacted in a host cell with baculovirus DNA of an appropriate baculovirus (that is, 
of the same species of baculovirus as the promoter encoded in the construct) under 
conditions such that recombination is effected. The resulting recombinant 
baculoviruses encode the full geranyl diphosphate synthase large subunit. For 
example, an insect host cell can be cotransfected or transfected separately with the 

20 DNA construct and a functional baculovirus. Resulting recombinant baculoviruses 
can then be isolated and used to infect cells to effect production of the geranyl 
diphosphate synthase large subunit. Host insect cells include, for example, 
Spodoptera frugiperda cells, that are capable of producing a baculovirus-expressed 
geranyl diphosphate synthase large subunit. Insect host cells infected with a 

25 recombinant baculovirus of the present invention are then cultured under conditions 
allowing expression of the baculovirus-encoded geranyl diphosphate synthase large 
subunit. Geranyl diphosphate synthase large subunit thus produced is then extracted 
from the cells using methods known in the art. 

Other eukaryotic microbes such as yeasts may also be used to practice this 

30 invention. The baker's yeast Saccharomyces cerevisiae, is a commonly used yeast, 
although several other types are available. The plasmid YRp7 (Stinchcomb et al.. 
Nature 282:39 [1979]; Kingsman et al.. Gene 7:141 [1979]; Tschemper et al.. Gene 
10:157 [1980]) is commonly used as an expression vector in Saccharomyces. This 
plasmid contains the trpl gene that provides a selection marker for a mutant strain of 

35 yeast lacking the ability to grow in the absence of tryptophan, such as strains ATCC 
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No. 44,076 and PEP4-1 (Jones, Genetics 85:12 [1977]). The presence of the trpl 
lesion as a characteristic of the yeast host cell genome then provides an effective 
environment for detecting transformation by growth in the absence of tryptophan. 
Yeast host cells are generally transformed using the polyethylene glycol method, as 
5 described by Hinnen {Proc. Natl. Acad. Sci. USA 75:1929 [1978]. Additional yeast 
transformation protocols are set forth in Gietz et al., N.A.R. 20(17): 1425, 1992; 
Reeves et al, FEMS 99:193-197, 1992. 

Suitable promoting sequences in yeast vectors include the promoters for 
3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255:2073 [1980]) or other 

10 glycolytic enzymes (HessetaL, J. Adv. Enzyme Reg. 7:149 [1968]; Holland et al.. 
Biochemistry 17:4900 [1978]), such as enolase, glyceraldehyde-3 -phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase,' phosphofructokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triose- 
phosphate isomerase, phosphoglucose isomerase, and glucokinase. In the 

15 construction of suitable expression plasmids, the termination sequences associated 
with these genes are also ligated into the expression vector 3' of the sequence desired 
to be expressed to provide polyadenylation of the mRNA and termination. Other 
promoters that have the additional advantage of transcription controlled by growth 
conditions are the promoter region for alcohol dehydrogenase 2, isocytochrome C, 

20 acid phosphatase, degradative enzymes associated with nitrogen metabolism, and the 
aforementioned glyceraIdehyde-3-phosphate dehydrogenase, and enzymes 
responsible for maltose and galactose utilization. Any plasmid vector containing 
yeast-compatible promoter, origin of replication and termination sequences is 
suitable. 

25 Cell cultures derived from multicellular organisms, such as plants, may be 

used as hosts to practice this invention. Transgenic plants can be obtained, for 
example, by transferring plasmids that encode geranyl diphosphate synthase large 
subunit and a selectable marker gene, e.g., the kan gene encoding resistance to 
kanamycin, into Agrobacterium tumifaciens containing a helper Ti piasmid as 

30 described in Hoeckema et al.. Nature 303:179-181 [1983] and culturing the 
Agrobacterium cells with leaf slices of the plant to be transformed as described by 
An et al.. Plant Physiology 81:301-305 [1986]. Transformation of cultured plant 
host cells is normally accomplished through Agrobacterium tumifaciens, as described 
above. Cultures of mammalian host cells and other host cells that do not have rigid 

35 cell membrane barriers are usually transformed using the calcium phosphate method 
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as originally described by Graham and VanderEb {Virology 52:546 [1978]) and 
modified as described in sections 16.32-16.37 of Sambrook et al., supra. However, 
other methods for introducing DNA into cells such as Polybrene (Kawai and 
Nishizawa, Mol. Cell Biol. 4:1172 [1984]), protoplast fusion (Schaffner, Proc. Natl. 
5 Acad. Set USA 77:2163 [1980]), electroporation (Neumann et al., EMBOJ. 1:841 
[1982]), and direct microinjection into nuclei (Capecchi, Cell 22:479 [1980]) may 
also be used. Additionally, animal transformation strategies are reviewed in 
Monastersky G.M. and Robl, J.M., Strategies in Transgenic Animal Science, ASM 
Press, Washington, D.C., 1995. Transformed plant calli may be selected through the 

10 selectable marker by growing the cells on a medium containing, e.g., kanamycin, and 
appropriate amounts of phytohormone such as naphthalene acetic acid and 
benzyladenine for callus and shoot induction. The plant cells may then be 
regenerated and the resulting plants transferred to soil using techniques well known 
to those skilled in the art. 

15 In addition, a gene regulating geranyl diphosphate synthase large subunit 

production can be incorporated into the plant along with a necessary promoter which 
is inducible. In the practice of this embodiment of the invention, a promoter that 
only responds to a specific external or internal stimulus is fused to the target cDNA. 
Thus, the gene will not be transcribed except in response to the specific stimulus. As 

20 long as the gene is not being transcribed, its gene product is not produced. 

An illustrative example of a responsive promoter system that can be used in 
the practice of this invention is the glutathione-S-transferase (GST) system in maize. 
GSTs are a family of enzymes that can detoxify a number of hydrophobic 
electrophilic compounds that often are used as pre-emergent herbicides 

25 (Weigand et al., Plant Molecular Biology 7:235-243 [1986]). Studies have shown 
that the GSTs are directly involved in causing this enhanced herbicide tolerance. 
This action is primarily mediated through a specific 1.1 kb mRNA transcription 
product. In short, maize has a naturally occurring quiescent gene already present that 
can respond to external stimuli and that can be induced to produce a gene product. 

30 This gene has previously been identified and cloned. Thus, in one embodiment of 
this invention, the promoter is removed from the GST responsive gene and attached 
to a geranyl diphosphate synthase large subunit gene that previously has had its 
native promoter removed. This engineered gene is the combination of a promoter 
that responds to an external chemical stimulus and a gene responsible for successful 

35 production of geranyl diphosphate synthase large subunit. 
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In addition to the methods described above, several methods are known in the 
art for transferring cloned DNA into a wide variety of plant species, including 
gymnosperms, angiosperms, monocots and dicots (see, e.g., Glick and 
Thompson, eds., Methods in Plant Molecular Biology, CRC Press, Boca Raton, 
Florida [1993], incorporated by reference herein). Representative examples include 
electroporation-facilitated DNA uptake by protoplasts in which an electrical pulse 
transiently permeabilizes cell membranes, permitting the uptake of a variety of 
biological molecules, including recombinant DNA (Rhodes et al.. Science, 
240:204-207 [1988]); treatment of protoplasts with polyethylene glycol 
(Lyznik et al.. Plant Molecular Biology, 13:151-161 [1989]); and bombardment of 
cells with DNA-laden microprojectiles which are propelled by explosive force or 
compressed gas to penetrate the cell wall (Klein etal., Plant Physiol. 91:440-444 
[1989] and Boynton et al., Science, 240:1534-1538 [1988]). Transformation of 
Taxus species can be achieved, for example, by employing the methods set forth in 
Han et al. Plant Science, 95:187-196 (1994), incorporated by reference herein. A 
method that has been applied to Rye plants (Secale cereale) is to directly inject 
plasmid DNA, including a selectable marker gene, into developing floral tillers (de la 
Pena et al. Nature 325:274-276 (1987)). Further, plant viruses can be used as 
vectors to transfer genes to plant cells. Examples of plant viruses that can be used as 
vectors to transform plants include the Cauliflower Mosaic Virus (Brisson et al., 
Nature 310: 511-514 (1984); Additionally, plant transformation strategies and 
techniques are reviewed in Birch, R.G., Ann Rev Plant Phys Plant Mol Biol, 48:297 
(1997); Forester etal., Exp. Agric, 33:15-33 (1997). The aforementioned 
publications disclosing plant transformation techniques are incorporated herein by 
reference, and minor variations make these technologies applicable to a broad range 
of plant species. 

Each of these techniques has advantages and disadvantages. In each of the 
techniques, DNA from a plasmid is genetically engineered such that it contains not 
only the gene of interest, but also selectable and screenable marker genes. A 
selectable marker gene is used to select only those cells that have integrated copies of 
the plasmid (the construction is such that the gene of interest and the selectable and 
screenable genes are transferred as a unit). The screenable gene provides another 
check for the successful culturing of only those cells carrying the genes of interest. A 
commonly used selectable marker gene is neomycin phosphotransferase II (NPT II). 
This gene conveys resistance to kanamycin, a compound that can be added directly to 



-22- 



the growth media on which the cells grow. Plant cells are normally susceptible to 
kanamycin and, as a result, die. The presence of the NPT II gene overcomes the 
effects of the kanamycin and each cell with this gene remains viable. Another 
selectable marker gene which can be employed in the practice of this invention is the 
5 gene which confers resistance to the herbicide glufosinate (Basta). A screenable gene 
commonly used is the (3-glucuronidase gene (GUS). The presence of this gene is 
characterized using a histochemical reaction in which a sample of putatively 
transformed cells is treated with a GUS assay solution. After an appropriate 
incubation, the cells containing the GUS gene tum blue. Preferably, the plasmid will 

1 0 contain both selectable and screenable marker genes. 

The plasmid containing one or more of these genes is introduced into either 
plant protoplasts or callus cells by any of the previously mentioned techniques. If the 
marker gene is a selectable gene, only those cells that have incorporated the DNA 
package survive under selection with the appropriate phytotoxic agent. Once the 

1 5 appropriate cells are identified and propagated, plants are regenerated. Progeny from 
the transformed plants must be tested to insure that the DNA package has been 
successfully integrated into the plant genome. 

Mammalian host ceils may also be used in the practice of the invention. 
Examples of suitable mammalian cell lines include monkey kidney CVI line 

20 transformed by SV40 (COS-7, ATCC CRL 165.1); human embryonic kidney line 
293S (Graham et al, J. Gen. Virol. 36:59 [1977]); baby hamster kidney cells (BHK, 
ATCC CCL 10); Chinese hamster ovary cells (Uriah and Chasin, Proc. Natl. Acad. 
Sci USA 77:4216 [1980]); mouse Sertoli cells (TM4, Mather, Biol. Reprod. 23:243 
[1980]); monkey kidney cells (CVI-76, ATCC CCL 70); African green monkey 

25 kidney cells (VERO-76, ATCC CRL- 1587); human cervical carcinoma cells (HELA, 
ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells 
(BRL3A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human 
liver cells (Hep G2, HB 8065); mouse mammary tumor cells (MMT 060562, 
ATCC CCL 51); rat hepatoma cells (HTC, MI.54, Baumann et al., J. Cell Biol. 85:1 

30 [1980]); and TRI cells (Mather et al.. Annals N.Y. Acad. Sci. 383:44 [1982]). 
Expression vectors for these cells ordinarily include (if necessary) DNA sequences 
for an origin of replication, a promoter located in front of the gene to be expressed, a 
ribosome binding site, an RNA splice site, a polyadenylation site, and a transcription 
terminator site. 
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Promoters used in mammalian expression vectors are often of viral origin. 
These viral promoters are commonly derived from polyoma virus, Adenovirus 2, and 
most frequently Simian Virus 40 (SV40). The SV40 virus contains two promoters 
that are termed the early and late promoters. These promoters are particularly useful 
5 because they are both easily obtained from the virus as one DNA fragment that also 
contains the viral origin of replication (Tiers et al.. Nature 273:1 13 [1978]). Smaller 
or larger SV40 DNA fi-agments may also be used, provided they contain the 
approximately 250-bp sequence extending from the Hindlll site toward the Bgll site 
located in the viral origin of replication. 

10 Alternatively, promoters that are naturally associated with the foreign gene 

(homologous promoters) may be used provided that they are compatible with the host 
cell line selected for transformation. 

An origin of replication may be obtained from an exogenous source, such as 
SV40 or other virus (e.g.. Polyoma, Adeno, VSV, BPV) and inserted into the cloning 

15 vector. Alternatively, the origin of replication may be provided by the host cell 
chromosomal replication mechanism. If the vector containing the foreign gene is 
integrated into the host cell chromosome, the latter is often sufficient. 

The use of a secondary DNA coding sequence can enhance production levels 
of geranyl diphosphate synthase large subunit in transformed ceil lines. The 

20 secondary coding sequence typically comprises the enzyme dihydrofolate reductase 
(DHFR). The wild-type form of DHFR is normally inhibited by the chemical 
methotrexate (MTX). The level of DHFR expression in a cell will vary depending on 
the amount of MTX added to the cultured host cells. An additional feature of DHFR 
that makes it particularly useful as a secondary sequence is that it can be used as a 

25 selection marker to identify transformed cells. Two forms of DHFR are available for 
use as secondary sequences, wild-type DHFR and MTX-resistant DHFR. The type 
of DHFR used in a particular host cell depends on whether the host cell is DHFR 
deficient (such that it either produces very low levels of DHFR endogenously, or it 
does not produce functional DHFR at all). - DHFR-deficient cell lines such as the 

30 CHO cell line described by Urlaub and Chasin, supra, are transformed with wild- 
type DHFR coding sequences. After transformation, these DHFR-deficient cell lines 
express functional DHFR and are capable of growing in a culture medium lacking the 
nutrients hypoxanthine, glycine and thymidine. Nontransformed cells will not 
survive in this medium. 
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The MTX-resistant form of DHFR can be used as a means of selecting for 
transformed host cells in those host cells that endogenousiy produce normal amounts 
of functional DHFR that is MTX sensitive. The CHO-Kl cell line (ATCC 
No. CL61) possesses these characteristics, and is thus a useful cell line for this 
5 purpose. The addition of MTX to the cell culture medium will permit only those 
cells transformed with the DNA encoding the MTX-resistant DHFR to grow. The 
nontransformed cells will be unable to survive in this medium. 

Prokaryotes may also be used as host cells for the initial cloning steps of this 
invention. They are particularly useful for rapid production of large amounts of 

10 DNA, for production of single-stranded DNA templates used for site-directed 
mutagenesis, for screening many mutants simultaneously, and for DNA sequencing 
. of the mutants generated. Suitable pro kary otic host cells include E. coli K12 strain 
294 (ATCC No. 31,446), E. coli strain W3110 (ATCC No. 27,325) E. coli X1776 
(ATCC No. 31,537), and E. coli B; however many other strains of E. coli, such as 

15 HBlOl, JMlOl, NM522, NM538, NM539, and many other species and genera of 
prokaryotes including bacilli such as Bacillus subtilis, other enterobacteriaceae such 
as Salmonella typhimurium or Serratia marcesans, and various Pseudomonas species 
may all be used as hosts. Prokaryotic host cells or other host cells with rigid cell 
walls are preferably transformed using the calcium chloride method as described in 

20 section 1.82 of Sambrook et al., supra. Alternatively, electroporation may be used 
for transformation of these cells. Prokaryote transformation techniques are set forth 
in Dower, W. J., in Genetic Engineering, Principles and Methods, 12:275-296, 
Plenum Publishing Corp., 1990; Hanahan et al, Meth. EfvcymoL, 204:63, 1991. 

As a representative example, cDNA sequences encoding geranyl diphosphate 

25 synthase large subunit may be transferred to the (His)5«Tag pET vector commercially 
available (from Novagen) for overexpression in E. coli as heterologous host. This 
pET expression plasmid has several advantages in high level heterologous expression 
systems. The desired cDNA insert is ligated in frame to plasmid vector sequences 
encoding six histidines followed by a highly specific protease recognition site 

30 (thrombin) that are joined to the amino terminus codon of the target protein. The 
histidine "block" of the expressed fusion protein promotes very tight binding to 
immobilized metal ions and permits rapid purification of the recombinant protein by 
immobilized metal ion affinity chromatography. The histidine leader sequence is 
then cleaved at the specific proteolysis site by treatment of the purified protein with 

35 thrombin, and the geranyl diphosphate synthase large subunit again purified by 
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imtnobilized metal ion affinity chromatography, this time using a shallower 
imidazole gradient to elute the recombinant synthase while leaving the histidine 
block still adsorbed. This overexpression-purification system has high capacity, 
excellent resolving power and is fast, and the chance of a contaminating E. coli 
5 protein exhibiting similar binding behavior (before and after thrombin proteolysis) is 
extremely small. 

As will be apparent to those skilled in the art, any plasmid vectors containing 
replicon and control sequences that are derived from species compatible with the host 
cell may also be used in the practice of the invention. The vector usually has a 

1 0 replication site, marker genes that provide phenotypic selection in transformed cells, 
one or more promoters, and a polylinker region containing several restriction sites for 
insertion of foreign DNA. Plasmids typically- used for transformation of E. coli 
include pBR322, pUC18, pUC19, pUCI18, pUC119, and Bluescript M13, all of 
which are described in sections 1.12-1.20 of Sambrook et al., supra. However, many 

15 other suitable vectors are available as well. These vectors contain genes coding for 
ampicillin and/or tetracycline resistance which enables cells transformed with these 
vectors to grow in the presence of these antibiotics. 

The promoters most commonly used in prokaryotic vectors include the 
P-lactaraase (penicillinase) and lactose promoter systems (Chang et al. Nature 

20 375:615 [1978]; Itakura et al.. Science 198:1056 [1977]; Goeddel et al.. Nature 
281:544 [1979]) and a tryptophan (trp) promoter system (Goeddel et al., Nucl. Acids 
Res. 8:4057 [1980]; EPO Appl. Publ. No. 36,776), and the alkaline phosphatase 
systems. While these are the most commonly used, other microbial promoters have 
been utilized, and details concerning their nucleotide sequences have been published, 

25 enabling a skilled worker to ligate them functionally into plasmid vectors (see 
Siebenlist et al.. Cell 20:269 [1980]). 

Many eukaryotic proteins normally secreted from the cell contain an 
endogenous secretion signal sequence as part of the amino acid sequence. Thus, 
proteins normally found in the cytoplasm can be targeted for secretion by linking a 

30 signal sequence to the protein. This is readily accomplished by ligating DNA 
encoding a signal sequence to the 5' end of the DNA encoding the protein and then 
expressing this fusion protein in an appropriate host cell. The DNA encoding the 
signal sequence may be obtained as a restriction fragment from any gene encoding a 
protein with a signal sequence. Thus, prokaryotic, yeast, and eukaryotic signal 

35 sequences may be used herein, depending on the type of host cell utilized to practice 
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the invention. The DNA and amino acid sequence encoding the signal sequence 
portion of several eukaryotic genes including, for example, human growth hormone, 
proinsulin, and proalbumin are known (see Stryer, Biochemistry W.H. Freeman and 
Company, New York, NY, p. 769 [1988]), and can be used as signal sequences in 
5 appropriate eukaryotic host cells. Yeast signal sequences, as for example acid 
phosphatase (Arimaetal., Nuc. Acids Res. 11:1657 [1983]), alpha-factor, alkaline 
phosphatase and invertase may be used to direct secretion from yeast host cells. 
Prokaryotic signal sequences from genes encoding, for example,, LamB or OmpF 
(Wong et al., Gene 68:193 [1988]), MalE, PhoA, or beta-lactamase, as well as other 

i 0 genes, may be used to target proteins from prokaryotic cells into the culture medium. 

The geranyl diphosphate synthase large subunit protein having the sequence 
set forth in SEQ ID N0:2 includes a putative amino terminal membrane insertion 
sequence at residues 1 through 48, and in the embodiment shown in SEQ ID N0:2 
directs the enzyme to plastids. Alternative trafficking sequences from plants, animals 

15 and microbes can be employed in the practice of the invention to direct the gene 
product to the cytoplasm, endoplasmic reticulum, mitochondria or other cellular 
components, or to target the protein for export to the medium. These considerations 
apply to the overexpression of geranyl diphosphate synthase large subunit, and to 
direction of expression within cells or intact organisms to permit gene product 

20 function in any desired location. 

The construction of suitable vectors containing DNA encoding replication 
sequences, regulatory sequences, phenotypic selection genes and the geranyl 
diphosphate synthase large subunit DNA of interest are prepared using standard 
recombinant DNA procedures. Isolated plasmids and DNA fragments are cleaved, 

25 tailored, and ligated together in a specific order to generate the desired vectors, as is 
well known in the art (see, for example, Sambrook et al., supra). 

As discussed above, geranyl diphosphate synthase large subunit variants are 
preferably produced by means of mutation(s) that are generated using the method of 
site-specific mutagenesis. This method requires the synthesis and use of specific 

30 oligonucleotides that encode both the sequence of the desired mutation and a 
sufficient number of adjacent nucleotides to allow the oligonucleotide to stably 
hybridize to the DNA template. 

In another aspect of the invention, a nucleic acid molecule encoding geranyl 
diphosphate synthase large subunit may be introduced into cancerous cells, together 

35 with a nucleic acid molecule encoding geranyl diphosphate synthase small subunit 
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and a nucleic acid molecule encoding a monoterpene synthase that produces a 
monoterpene having anti-cancer properties. Nucleic acid molecules encoding 
geranyl diphosphate synthase large subunit and small subunit (or a single nucleic 
acid molecule encoding both large and small subunits) must be introduced into 
5 cancerous cells, in addition to a gene encoding a monoterpene synthase producing a 
monoterpene having anti-cancer properties, because animal cells do not naturally 
produce geranyl diphosphate which is the chemical precursor to the monoterpenes. 
Examples of monoterpenes having anti-cancer properties are limonene, perillyl 
alcohol and geraniol, as discussed supra. Examples of nucleic acid sequences that 

10 encode monoterpene synthases are disclosed in the following, copending patent 
applications, each of which is incorporated herein by reference: U.S. Patent 
Application Serial Number 08/846,526 "DNA Encoding Limonene Synthase from 
Mentha spicata"; U.S. Patent Application Serial Number 08/937,540 "Monoterpene 
Synthases from Common Sage {Salvia officinalis) and PCX Patent Application Serial 

1 5 Number PCT/US98/14528 "Monoterpene Synthases from Grand fir (Abies grandis)." 

Several methods are known in the art for the introduction of genes into human 
cells. For example, cell-based therapy can be used to introduce genes into cells while 
they are outside of the body. Cell-based approaches involve removing cells from a 
patient, introducing genes encoding a therapeutic protein into the removed cells, and 

20 returning the cells to the patient by cell transplantation or transfusion. The cell-based 
approach has been used to treat Severe Combined Immune Deficiency (SCID), which 
is due to inherited defects in the enzyme adenosine deaminase (ADA). The gene 
therapy treatment of SCID involved removal of peripheral blood lymphocytes or 
bone marrow progenitor cells from affected individuals, introduction of the normal 

25 ADA gene into the chromosomes of these cells using retroviral vectors, and 
reintroduction of the genetically engineered cells to the patient (C. Bordignon et al. 
Science 270:470; 474 (1995), R.M. Blaese et al.. Science 270:475-479 (1995); 
D.B. Kohn et al.. Nature Med. 1:1017-1023 (1995)). Initial results demonstrated that 
the genetically engineered cells will persist for prolonged periods of time, and that 

30 low level expression of ADA can be established. 

Analogous cell-based approaches have been used to treat familial 
hypercholesterolemia (LDL-receptor deficiency) (M. Grossman et al.. Nature 
Genetics 6:335 41 (1994); M.Grossman et al.. Nature Med. 1:1148-1154 (1995)) 
and Gaucher disease (J.A. Nolta etal., J.Clin. Invest. 90:342-348 (1992); 
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L. Xuetal., Exptl. Hematol. 22:223-230 (1994); T. Ohashi etal., Proc. Natl. Acad. 
Sci. 89:11332-11336 (1992)). 

Genes can be introduced into cells in situ, or after removal of the cells from 
the body, by means of viral vectors. For example, retroviruses are RNA viruses that 
5 have the ability to insert their genes into host cell chromosomes after infection. 
Retroviral vectors have been developed that lack the genes encoding viral proteins, 
but retain the ability to infect cells and insert their genes into the chromosomes of the 
target ceil (A.D. Miller, Hum. Gen. Ther. 1:5-14 (1990)). Retroviruses will only 
efficiently infect dividing cells, thus when retroviruses are used to introduce genes 

10 into cells that have been removed from the body, cell division is stimulated with 
growth-promoting media or specific factors. In vivo application of retroviruses has 
been achieved by administration of virus-producing cells directly into tumors. Virus 
particle released by the infected cell will infect adjacent tumor cells, hence only a 
relatively small percentage of cells in a tumor need be initially infected in order to 

15 ultimately introduce the targeted gene into most or all of the tumor cells. (K.W. 
Culver et al., 5c/e«ce 256:1550-1552 (1992)). 

Adenoviral vectors are designed to be administered directly to patients. 
Unlike retroviral vectors, adenoviral vectors do not integrate into the chromosome of 
the host cell. Instead, genes introduced into cells using adenoviral vectors are 

20 maintained in the nucleus as an extrachromosomal element (episome) that persists for 
a limted time period. Adenoviral vectors will infect dividing and non-dividing cells 
in many different tissues in vivo including airway epithelial cells, endothelial cells, 
hepatocytes and various tumors (B.C. Trapnell, Adv Drug Del Rev. 12:185-199 
(1993)). 

25 Another viral vector is the herpes simplex virus, a large, double-stranded 

DNA virus that has been used in some initial applications to deliver therapeutic genes 
to neurons and could potentially be used to deliver therapeutic genes to some forms 
of brain cancer (D.S. Latchman, Mol. Biotechnol. 2:179-95 (1994)). Recombinant 
forms of the vaccinia virus can accommodate large inserts and are generated by 

30 homologous recombination. To date, this vector has been used to deliver interleukins 
(ILs), such as human IL-lp and the costimulatory molecules B7-1 and B7-2 
(G.R. Peplinski etal., Ann. Surg. Oncol. 2:151-9 (1995); J.W. Hodge etal.. Cancer 
Res. 54:5552-55 (1994)). 

Another approach to gene therapy involves the direct introduction of DNA 

35 plasmids into patients. (F.D. Ledley, Hum. Gene Ther. 6:1129-1144 (1995)). The 
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piasmid DNA is taken up by cells within the body and can direct expression of 
recombinant proteins. Typically plasmid DNA is delivered to cells in the form of 
liposomes in which the DNA is associated with one or more lipids, such as DOTMA 
(l,2,-diolcyloxypropyl-3-trimethyl ammonium bromide) and DOPE 
5 (dioleoylphosphatidylethanolamine). Formulations with DOTMA have been shown 
to provide expression in pulmonary epithelial cells in animal models (KX. Brigham 
et al, Am. J. Med. Sci, 298:278-281 (1989); A.B. Canonico et al.. Am. J. Respir. Cell. 
Mol. Biol. 10:24-29 (1994)). Additionally, studies have demonstrated that 
intramuscular injection of plasmid DNA formulated with 5% PVP (50,000 kDa) 

10 increases the level of reporter gene expression in muscle as much as 200-fold over 
the levels found with injection of DNA in saline alone (R.J. Mumper et al., Pharm. 
Res. 13:701-709 (1996); R.J. Mumper et al., Proc. Intern. Syinp. Cont. Rol. Bioac. 
Mater. 22:325-326 (1995)). Intramuscular administration of plasmid DNA results in 
gene expression that lasts for many months (J.A. Wolff et al. Hum. Mol. Genet. 

15 1:363-369 (1992); M. Manthorpe et al.. Hum. Gene Ther. 4:419-431 (1993); 
G. Ascadi etal., A^ew Biol. 3:71-81 (1991), D. Gal et al.. Lab. Invest. 68:18-25 
(1993)). 

Additionally, uptake and expression of DNA has also been observed after 
direct injection of plasmid into the thyroid (M. Sikes et al., Hum. Gene Ther. 

20 5:837-844 (1994)) and synovium (J. Yovandich et al.. Hum. Gene Ther. 6:603-610 
' (1995)). Lower levels of gene expression have been observed after interstitial 
injection into liver (M.A.Hickman et al. Hum. Gene Ther. 5:1477-1483 (1994)), 
skin (E. Raz et al., Proc. Natl Acad. Sci. 91:9519-9523 (1994)), instillation into the 
airways (K.B. Meyer etal.. Gene Therapy 2:450-460 (1995)), application to the 

25 endothehum (G.D. Chapman etal.. Circulation Res. 71:27-33 (1992); R. Riessen 
et al.. Human Gene Therapy, 4:749-758 (1993)), and after intravenous administration 
(R.M. Conry et al.. Cancer Res. 54:1164-1 168 (1994)). 

Various devices have been developed for enhancing the availability of DNA 
to the target cell. A simple approach is to contact the target cell physically with 

30 catheters or implantable materials containing DNA (G.D. Chapman et al.. 
Circulation Res. 71:27-33 (1992)). Another approach is to utilize needle-free, jet 
injection devices which project a column of liquid directly into the target tissue under 
high pressure. (P.A. Furth et al., ^na/ Biochem. 20:365-368 (1992); (H.L. Vahlsing 
etal., J. Immunol. Meth. 175:11-22 (1994); (F.D. Ledley etal.. Cell Biochem. 

35 18A:226 (1994)). 
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Another device for gene delivery is the "gene gun" or Biolistic"''"'^, a ballistic 
device that projects DNA-coated micro-particles directly into the nucleus of cells 
in vivo. Once within the nucleus, the DNA dissolves from the gold or tungsten 
microparticle and can be expressed by the target cell. This method has been used 
5 effectively to transfer genes directly into the skin, liver and muscle (N.S. Yang et ai., 
Proc. Natl. Acad. Sci. 87:9568-9572 (1990); L. Cheng et al., Proc. Natl. Acad. Sci. 
USA. 90:4455-4459 (1993); R.S. Williams et al, Proc. Natl. Acad. Sci. 
88:2726-2730(1991)). 

Another approach to targeted gene delivery is the use of molecular 

1 0 conjugates, which consist of protein or synthetic iigands to which a nucleic acid- or 
DNA-binding agent has been attached for the specific targeting of nucleic acids to 
cells (R.J. Cristiano et al., Proc. Natl. Acad. Sci. USA 90:11548-52 (1993); B.A. 
Bunnell et al., Somat. Call Mol. Genet. 18:559-69 (1992); M. Gotten et al., Proc. 
Natl. Acad Sci. USA 89:6094-98 (1992)). Once the DNA is coupled to the 

15 molecular conjugate, a protein-DNA complex results. This gene delivery system has 
been shown to be capable of targeted delivery to many cell types through the use of 
different hgands (R.J. Cristiano et al., Proc. Natl. Acad Sci. USA 90:11548-52 
(1993)). For example, the vitamin folate has been used as a ligand to promote 
delivery of plasmid DNA into cells that overexpress the folate receptor (e.g., ovarian 

20 carcinoma cells) (S. Gottschalk etal.. Gene Ther. 1:185-91 (1994)). The malaria 
circumsporozoite protein has been used for the liver-specific delivery of genes under 
conditions in which ASOR receptor expression on hepatocytes is low, such as in 
cirrhosis, diabetes, and hepatocellular carcinoma (Z. Ding et al., J. Biol. Chem. 
270:3667-76 (1995)). The overexpression of receptors for epidermal growth factor 

25 (EOF) on cancer cells has allowed for specific uptake of EGF/DNA complexes by 
lung cancer cells (R. Cristiano et al.. Cancer Gene Ther. 3:4-10 (1996)). 

Targeted expression of genes encoding proteins having anti-cancer activity 
can be achieved by placing the transgene under the control of an inducible promoter. 
For example, the promoter for the carcinoembryonic antigen (CEA) gene has been 

30 incorporated in vectors and it has directed cell-specific expression of the resulting 
CEA-expression vector constructs in tumors cells, such as those of pancreatic 
carcinoma (J.M. DiMaio etal.. Surgery 116:205-13 (1994)). The regulatory 
sequences of the human surfactant protein A gene have been used to generate cell- 
specific expression in non-small-cell lung cancers that express this protein (M.J. 

35 Smith et al.. Hum. Gene Ther. 5:29-35 (1994)). 
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Another approach to introducing geranyl diphosphate synthase protein (large 
and small subunits), and monoterpene synthase protein, into a cancerous cell is to 
directly introduce the purified protein into the body. Typically, the protein is 
introduced in association with another molecule, such as a lipid, to protect the protein 
5 from enzymatic degradation. For example, the covalent attachment of polymers, 
especially polyethylene glycol (PEG), has been used to protect certain proteins from 
enzymatic hydrolysis in the body and thus prolong half-life (F. Fuertges, et al., 
J. Controlled Release, 11:139 (1990)). Many polymer systems have been reported 
for protein delivery (Y.H. Bae, et al., J. Controlled Release, 9:271 (1989); R. Hori, 
10 et al., Pharm. Res., 6:813 (1989); I. Yamakawa, et al, J. Pharm. Sci., 79:505 (1990); 
I.Yoshihiro, et al., J. Controlled Release, 10:195 (1989); M. Asano, et al., J. 
Controlled Release, 9:1 11 (1989); J. Rosenblatt et al., J. Controlled Release, 9:195 

(1989) ; K. Makino, J. Controlled Release, 12:235 (1990); Y. Takakura et al., 
J. Pharm. ScL, 78:117 (1989); Y. Takakura et al., J. Pharm. Sci., 78:219 (1989)). 

15 Therapeutic proteins can be introduced into the body by application to a 

bodily membrane capable of absorbing the protein, for example the nasal, 
gastrointestinal and rectal membranes. The protein is typically applied to the 
absorptive membrane in conjunction with a permeation enhancer. (V.H.L. Lee, Cril. 
. Rev. Ther. Drug Carrier Syst., 5:69 (1988); V.H.L. Lee, J. Controlled 

20 Release, 13:213 (1990); V.H.L. Lee, Ed., Peptide and Protein Drug Delivery, Marcel 
Dekker, New York (1991); A.G. DeBoer et al., J. Controlled Release, 13:241 

(1990) ). For example, STDHF is a synthetic derivative of fusidic acid, a steroidal 
surfactant that is similar in structure to the bile salts, and has been used as a 
permeation enhancer for nasal delivery. (W.A. Lee, Biopharm. Nov./Dec, 22, 1990). 

25 Additionally, microspheres bearing therapeutic protein can be delivered to the 

body. In one application, a bioadhesive was used to hold microspheres bearing 
protein in place in the nasal passages. When an absorption enliancer was 
incorporated into the microsphere with the protein, bioavailability was increased 
(L. Ilium, etal.. Int. J. Pharm., 63:207 (1990); N.F. Farraj et al., J. Controlled 

30 i^e/ea^e, 13:253 (1990)). 

The foregoing may be more fully understood in connection with the 
following representative examples, in which "Plasmids" are designated by a lower 
case p followed by an alphanumeric designation. The starting plasmids used in this 
invention are either commercially available, publicly available on an unrestricted 

35 basis, or can be constructed from such available plasmids using published 
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procedures. In addition, other equivalent plasmids are known in the art and will be 
apparent to the ordinary artisan. 

"Digestion", "cutting" or "cleaving" of DNA refers to catalytic cleavage of 
the DNA with an enzyme that acts only at particular locations in the DNA. These 
5 enzymes are called restriction endonucleases, and the site along the DNA sequence 
where each enzyme cleaves is called a restriction site. The restriction enzymes used 
in this invention are commercially available and are used according to the 
instructions supphed by the manufacturers. (See also sections 1.60-1.61 and 
sections 3.38-3.39 of Sambrook et al., supra.) 

10 "Recovery" or "isolation" of a given fragment of DNA from a restriction 

digest means separation of the resulting DNA fragment on a polyacrylamide or an 
agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known molecular 
weight, removal of the gel section containing the desired fragment, and separation of 

1 5 the gel from DNA. This procedure is known generally. For example, see Lawn et al. 
{Nucleic Acids Res. 9:6103-6114 (1982)), and Goeddel et al. {Nucleic Acids Res., 
supra). 

The following examples merely illustrate the best mode now contemplated 
for practicing the invention, but should not be construed to limit the invention. 
20 EXAMPLES 

Example 1 

Isolation of Geranyl Diphosphate Synthase Large Subunit 
Plant materials, substrates and reagents. Mint plants {Mentha spicata and 
M. X piperita) were propagated and grown as previously described (W. Alonso et al., 
25 J. Biol Chem. 267:7582-7587, 1992). Newly emerged, rapidly expanding leaves (5- 
1 0 mm long) of vegetative stems (3-7 weeks-old) were used for the preparation of 
glandular trichome cells for enzyme purification (J. Gershenzon et al.. Anal. 
Biochem. 200:130-138, 1992). [4-l4C]Isopentenyl diphosphate (54 Ci/mol) was 
purchased from DuPont/NEN. Dimethylallyl diphosphate was synthesized as 
30 described (V.J. Davisson et al.. Methods Enzymol. 110:130-144, 1985), as was 
geranyl diphosphate (R. Croteau et al., Arch. Biochem. Biophys. 309:184-192, 1994) 
and famesyl diphosphate (V.M. Dixit et al., J. Org. Chem. 46:1967-1969, 1981). 

Assay for prenyltransf erase activity. To 1 0 jal of enzyme solution was added 
70 \A Mopso buffer (25 mM, pH 7.0) containing 10% glycerol, 10 mM MgCl2, and 1 
35 mM DTT. DMAPP (50 !aM).and [4-14c]IPP (7 )x.M) were added (100 total 
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volume) to initiate the reaction, and the contents were overlaid with 1 ml hexane. 
The mixture was vortexed briefly and then incubated for 1 h at 31°C. After 
incubation, 10 i^l of 3 N HCl was added, the contents vortexed and centrifuged, and 
hydrolysis of the products was continued for 20 min at 31°C. After acid hydrolysis 
5 was complete, the reaction mixture was again vortexed and centrifuged so that the 
products derived from the acid labile allylic diphosphates (or those alcohols derived 
from hydrolysis by endogenotis phosphatases) were partitioned into the hexane layer. 
The hexane was removed and the radioactive products contained therein were 
measured by liquid scintillation counting of an aliquot. For the assay based on 

1 0 enzymatic, rather than acid, hydrolysis, the diphosphate ester products and remaining 
substrates of the incubation mixture were hydrolyzed by treatment with 1 unit each (2 
mg) of wheat germ alkaline phosphatase and potato apyrase, added to each assay in a 
volume of 1 ml of 200 mM Tris buffer (pH 9.5), and allowed to incubate for at least 
8 h at 30°C. The organic extract was then isolated for analysis as before. 

15 Product identification. For the identification of reaction products, 50 p-l of 

the enzyme preparation was diluted into 130 jj.1 of Mopso buffer (25 mM, pH 7.0) 
containing 10% glycerol, 10 mM MgCl2, and 1 mM DTT. DMAPP, GPP or FPP (35 
j^M) and [4-14C]IPP (35 |j.M) were added to initiate the reaction for GPP synthase, 
FPP synthase and GGPP synthase, respectively, and pentane was substituted for the 

20 hexane overlay to improve recovery. After acid or enzymatic hydrolysis and removal 
of the pentane layer as described above, the reaction mixture was extracted with 2 x 1 
ml of diethyl ether to ensure complete recovery of products. The combined organic 
extract was then dried over anhydrous Na2S04 and concentrated to 100 p.1, followed 
by the addition of internal standards and further concentration to 20 jj,l for radio-GLC 

25 analysis. The products sought were: from geranylgeranyl diphosphate, all trans- 
geranylgeraniol from enzyme (phosphatase)-catalyzed hydrolysis in addition to 
geranylnerol and geranyllinalool from acid catalyzed rearrangement (total C20 
alcohols); from farnesyl diphosphate, all rram-farnesol from phosphatase -catalyzed 
or acid hydrolysis, and cis,trans-iQ.m&so\ and nerolidol from acid-catalyzed 

30 rearrangement (total C15 alcohols); from geranyl diphosphate, geraniol from 
phosphatase-catalyzed or acid hydrolysis, and nerol and linalool from acid-catalyzed 
rearrangement (total C\q alcohols); and total C5 alcohols (dimethylallyl alcohol, 
isopentenol and dimethylvinyl carbinol). 

Preparation of mint glandular trichome extracts. Glandular trichome cell 

35 clusters (approximately 2x 10"^) were isolated from 40 g of leaf tissue following 
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procedures previously described (J. Gershenzon et ai., Anal. Biochem. 200:130-138, 
1992). The isolated cell clusters were suspended in potassium phosphate buffer (50 
ml, 100 mM, pH 7.4, containing 5 g XAD, 0.5 g PVPP, 250 mM sucrose, 1 mM 
DTT, 1 mM Benzamidine and 1 mM Na4EDTA) and disrupted by sonication (Braun- 
5 sonic 2000, full power, five 15 s bursts separated by 45 s cooling in ice). The 
sonicate was filtered through a 20 p-m nylon mesh and the filtrate was brought to 1 00 
ml by the addition of 50 ml potassium phosphate buffer without XAD or PVPP. The 
sonicate was then centrifuged at 12.000g" (30 min), then at 195,000g (90 min), and 
the supernatant was utilized as the enzyme source. 

10 Dye-ligand interaction chromatography. The supernatant (generally 

combined from two gland preparations, -200 ml) was dialyzed (2x, 4°C, 18 h total) 
in MES buffer (4 liters, 25 mM, pH 6.2) containing 10% glycerol, 1 mM DTT, and 
10 mM MgCi2. The dialyzed supernatant was equally divided into 8 (50 ml) 
polypropylene tubes containing 5 ml of DyeMatrex Red A Gel (Amicon) equilibrated 

15 with dialysis buffer in each tube. After 1 h of gentle mixing (Labquake), the contents 
were poured into eight 1.5x12 cm polypropylene columns (Bio-Rad), gravity 
drained, and washed with 4x volumes of dialysis buffer. Geranyl diphosphate 
synthase was then eluted with Hepes buffer (240 ml, 25 mM, pH 7.2) containing 10% 
glycerol, 5 mM potassium phosphate, 1 mM DTT, and 1 mM EDTA. The entire 

20 procedure was performed at 0-4°C. 

Anion exchange chromatography. The elutant from the dye-ligand 
interaction chromatography step was loaded on to an HR 10/10 column containing 
Source 15Q anion-exchange media (Pharmacia Biotech) equilibrated in Hepes buffer 
(25 mM, pH 7.5) containing 10% glycerol, 10 mM MgCl2 and 1 mM DTT. Geranyl 

25 diphosphate synthase was eluted with a discontinuous gradient (0-90 and then 90-370 
mM KCl; total volume 140 ml). Famesyl diphosphate synthase activity eluted at 90 
mM KCl; geranyl diphosphate synthase activity eluted at -200 mM KCl; 
geranylgeranyl diphosphate synthase activity was not detected in any fraction of the 
chromatographic run, including the flowthrough upon sample loading and the final 1 

30 M KCl wash step. This anion-exchange chromatography step afforded the most 
efficient purification of geranyl diphosphate synthase, and the fractions (6 ml) with 
the highest activity were collected and stored at -80°C. 

To identify the geranyl diphosphate synthase protein in the anion-exchange 
chromatography fractions, equal volumes from each fraction containing geranyl 

35 diphosphate synthase activity were individually loaded onto separate lanes of an 
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SDS-PAGE gel, and the proteins contained therein were resolved and silver stained 
to reveal the presence of three prominent proteins, at 28 ± 1 kDa, 3 1 ± 1 kDa and 
37 ± 1 kDa, which best tracked the activity. The protein at 28 ± 1 kDa was initially 
considered to represent the geranyl diphosphate synthase based on staining intensity, 
5 coincidence of protein and activity, and consistency of size of this presumptive 
subunit of an assumed homodimer of -60 kDa. The size of the native enzyme from 
Mentha (70 ± 7 kDa) was initially established by gel permeation chromatography 
(Superdex 75), and all previously reported short-chain prenyitransferases are 
homodimers (K. Ogura and T. Koyama, Chem. Rev. 98:1263-1276, 1998; T. Koyama 

10 and K. Ogura, "Isopentenyl Diphosphate Isomerase and Prenyitransferases," in 
Comprehensive Natural Products Chemistry: Isoprenoids, D.E. Cane, ed.. Vol. 2, 
Elsevier Science, Oxford, 1999, pp. 69-96). 

Preparative SDS-PAGE. The partially purified geranyl diphosphate synthase 
from the anion exchange chromatography step (60 ml) was heated to 95°C for 15 

15 min, cooled, and exhaustively dialyzed in distilled water (2x, 4 liters, 18 h, 4°C). 
The protein solution was then lyophilized, and the dried powder was suspended in 
100 [x\ of SDS buffer plus 50 ^1 of 3x loading buffer and separated by SDS-PAGE on 
12.5% acrylamide at 35 mA for 6 h [15 cm x 18 cm x 1.5 mm gel]) by standard 
protocols (U.K. Laemmli, Nature 227:680-685, 1970). Coomassie Blue staining 

20 revealed at least ten protein bands, with prominent species corresponding to 28 ± 1, 
31 ± 1 and 37 ± 1 kDa that were estimated at -10 jig protein based upon staining 
intensity calibrated with carbonic anhydrase as reference. All ten protein bands, 
including the 28 and 37 kDa gef bands that were the most coincident with geranyl 
diphosphate synthase activity on anion-exchange chromatography, were excised from 

25 the gel and stored in microcentrifuge tubes. 

Protein sequencing. The excised gel bands containing the ten separated 
proteins were individually digested with trypsin (Promega V511/l,2) following 
published protocols (J.E. Coligan, "Digestion of Proteins in Gels for Sequence 
Analysis," in Current Protocols in Protein Science, Vol. 1, J.E. Coligan et al., eds., 

30 John Wiley and Sons, New York, 1996, pp. 11.3.1-11.3.13). The resulting peptide 
mixtures, including that derived from the 37 kDa presumptive geranyl diphosphate 
synthase large subunit, were then individually loaded onto a reversed phase HPLC 
(CI 8) column (Brownlee ODS-300), which was equilibrated with distilled water/1% 
TFA (buffer A) and developed by gradient elution with buffer B consisting of 70% 

35 CH3CN, 29% distilled water and 1% TFA (0-60 min, 0%-37% buffer B/ 60-90 min. 
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37%-75% buffer B/ 90-105 min, 75%- 100% buffer B). The purified peptides were 
subjected to amino-terminal sequence analysis via Edman degradation at the 
Washington State University Laboratory for Biotechnology and Bioanalysis. Two of 
the five purified peptides derived from the 28 kDa protein yielded unambiguous 
5 peptide sequences comprising FGLYQGTL (SEQ ID N0:12) and VIIEIS (SEQ ID 
NO: 13), thereby confirming that the 28 kDa protein is the protein characterized as 
geranyl diphosphate synthase in U.S. Patent No. 5,876,094 to Croteau et al.. Four of 
the six purified peptides derived from the 37 kDa protein yielded sequences 
comprising LIGVE (peptide 1) (SEQ ID N0:3), YIAYR (peptide 2) (SEQ ID NO:4), 
10 TAALLTGSVVLGAIL (peptide 3) (SEQ ID N0:5) and EAVETLLHF (peptide 4) 
(SEQ ID NO:6). No other useful amino acid sequence information was obtained 
from the remaining peptides derived from the 28 kDa protein or the 37 kDa protein 
because of ambiguities due to low recoveries of the peptide or the presence of 
contaminants. 

15 Example 2 ■ 

Cloning of Geranyl Diphosphate Synthase Large Subunit cDN As 
Since further scale-up of the geranyl diphosphate synthase large subunit 
purification protocol was impractical, and because the limited peptide sequence 
information, due to degeneracy considerations, precluded a reverse genetic approach 

20 to cDNA cloning, alternate means to acquire a geranyl diphosphate synthase large 
subunit cDNA were attempted. To improve the chances for acquiring this cDNA 
target, methods were developed to isolate mRNA specifically from mint oil glands, 
the exclusive site of monoterpene biosynthesis in Mentha (J. Gershenzon et al.. Anal 
Biochem. 200:130-138, 1992), for the purpose of constructing a highly enriched 

25 cDNA library containing the sequences of interest. 

Glandular trichome cDNA library construction. Available methods of RNA 
isolation and purification, and for secretory cell isolation, are incompatible. The use 
of chaotropic salts or organic solvents as an initial denaturant of ubiquitous RNases 
is not possible because of the long leaf imbibition periods required during the initial 

30 stages of secretory cell isolation. A modified RNA isolation and purification 
protocol was successfully developed which incorporated the use of low molecular 
weight RNase inhibitors in the imbibition medium. Thus, secretory cells were 
isolated from 5-day-old peppermint (J. Gershenzon et al.. Anal. Biochem. 200:130- 
138, 1992) from plants which had been grown as previously described (W.R. Alonso 

35 etal., J. Biol. Chem. 267:7582-7587, 1992) but, in this case using 5 mM 
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aurintricarboxylic acid (R.G. Gonzalez et al.. Biochemistry 19:4299-4303, 1980) and 
1 mM thiourea (E. Van Driessche et al.. Anal. Biochem. 141:184-188, 1984) 
throughout all procedures to prevent enzymatic and nonenzymatic degradation of 
RNA. 

5 Total RNA was then extracted from the isolated secretory cells using the 

method of Logemann (J. Logemann et al.. Anal. Biochem. 163:16-20, 1987) which 
had been modified (S. Lupien et al.. Arch. Biochem. Biophys. 368: 181-192, 1999) to 
include a 10% ethanol precipitation step to remove interfering polysaccharides. 
Poly(A)''"-RNA was purified by chromatography on oligo(dT)-cellulose (Pharmacia), 

10 and 5 p.g of the resulting mRNA was utilized to construct a XZAPII cDNA library 
according to the manufacturer's instructions (Stratagene). 

Homology-based cloning attempts. Concurrent with attempts to purify and 
sequence the native geranyl diphosphate synthase protein, a homology-based PGR 
strategy was devised to isolate the target cDNA, based on the assumption that 

15 geranyl diphosphate (GPP) synthase would resemble in sequence famesyl 
diphosphate (FPP) synthase and/or geranylgeranyl diphosphate (GGPP) synthase. 
After over a year of effort and sequencing of numerous candidate PGR products, the 
only relevant amplicons obtained showed very high homology to famesyl 
diphosphate synthase. Subsequent library screening with these amplicons as labeled 

20 probes led to the acquisition of the corresponding full-length cDNA, that when 
functionally expressed in E. coli confirmed that this cDNA did, in fact, encode FPP 
synthase. 

Random sequencing of an oil gland library. Random cDNA clones from a 
peppermint oil gland cDNA library were sequenced in an effort to identify 

25 prenyltransferase (GPP synthase)-like cDNAs. Plasmids were purified from 
individual colonies arising from a mass excision of mint gland A.ZAPII phagemids 
(Stratagene) and the inserts were sequenced (DyeDeoxy Terminator Gycle 
Sequencing, applied Biosysteras), with the data subsequently acquired on the ABI 
sequenator. The NCBI BLAST server was used for database searching using the 

30 programs of the GGG Wisconsin package (Genetics Gomputer Group, Program 
Manual for the Wisconsin Package, Version 8, Genetics Computer Group, Madison, 
WI, 1994). 

Of the approximately 130 individual clones initially isolated and sequenced, 
one (SEQ ID N0:1) revealed significant homology to geranylgeranyl diphosphate 
35 (GGPP) synthases of plant origin (74-93% similarity; 67-83% identity). Two 
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primers designated GG23F (SEQ ID N0:7) and GG23R (SEQ ID N0:8) were 
designed to amplify a 5'-region (101 bp) of this sequence (SEQ ID NO:9). The 
resulting amplicon was then labeled with [32p]dATP using the same primers (SEQ 
ID NO:7) (SEQ ID NO: 8), and employed as a hybridization probe to screen at high 
5 stringency the oil gland cDNA library. Ten positive clones were purified through a 
second round of screening and were sequenced to yield the full-length cDNA insert 
of pMp23.10 (SEQ ID NOT). In spite of the fact that geranylgeranyl diphosphate 
synthase activity could not be demonstrated in mint oil gland extracts, this clone 
(SEQ ID NOT) was initially considered to encode geranylgeranyl diphosphate 
10 synthase since all plants are required to produce geranylgeranyl diphosphate as an 
essential precursor of chlorophyll, carotenoids, and gibberellin plant growth 
hormones (C.A. West, "Biosynthesis of Diterpenes," in Biosynthesis of Isoprenoid 
Compounds, J.W. Porter and S.L. Spurgeon, eds.. Vol 1, Wiley, New York, NY, 
1981, pp. 375-411). 

15 As disclosed in Example 1 herein, purification of the native geranyl 

diphosphate synthase large subunit from mint yielded a 37 kDa protein upon SDS- 
PAGE. Significantly, alignment of the four peptide sequences derived from this 37 
kDa protein observed in the highly purified preparation, with the deduced amino acid 
sequence of the protein encoded by the insert of pMp23.10 (SEQ ID NOT), revealed 

20 that peptide 1 (LIGVE) (SEQ ID NO:3) corresponded exactly to deduced amino acid 
residues 333 to 337 of SEQ ID N0:2, that peptide 2 (YIAYR) (SEQ ID N0:4) 
corresponded exactly to deduced amino acid residues 371 to 375 of SEQ ID NO:2, 
that peptide 3 (TAALLTGSVVLGAIL) (SEQ ID NO: 5) corresponded to residues 
263 to 277 of SEQ ID NO:2 and peptide 4 (EAVETLLHF) (SEQ ID NO:6) to 

25 residues 349-357 of the pMp23.10 protein (SEQ ID NO:2). This observation cast 
some doubt upon the identification of the pMp23.10 protein (SEQ ID N0:2) as a 
geranylgeranyl diphosphate synthase (for which no activity was observed), and 
prompted further evaluation of this clone. 

Subcloning and heterologous expression. The cDNAs were subcloned into 

30 expression vectors that would allow high levels of bacterial {E. coli) production, as 
well as the ability to co-express both clones in a single bacterial ceil. Plants employ 
different codon usage than E. coli, and the presence of the arginine codons AGA and 
AGG in a sequence can lead to mistranslation or truncation of such eukaryotic 
encoded proteins when heterologously expressed in E. coli. Since most of the 

35 arginines in the sequences of clones Mpl3.18 (SEQ ID N0:1 1) (small subunit) and 
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Mp23.10 (SEQ ID N0:2) (large subunit) are coded for usage by these rare E. coli 
tRNAs, the pET3a/pACYC-derived vector, pSBETa, was used. This vector encodes 
kanamycin resistance, drives expression with T7 DNA polymerase from the strong 
T7 promoter, and additionally carries the argU gene for the tRNA that specifies rare 
5 codon usage to improve translation of such arginine residues (P.M. Schenk et al., 
BioTechniques 19:196-200, 1995). 

The full-length open reading frame of pMpl3.18 (SEQ ID NO: 10) was 
cloned directionally into pSBETa by the addition of an Ndel site at the starting 
methionine by site directed mutagenesis (QuickChange, Stratagene), and the use of a 

10 convenient BarrMl site (8 bp downstream of the stop codon). The vector and the 
engineered derivative of pMpl3.18 (SEQ ID NO: 10), designated pMpl3.18N, were 
doubly-digested with BamRl and Ndel, the fragment purified and ligated overnight, 
and then transformed into E. coli XL 1 -Blue competent cells. The resulting plasmid, 
designated pSB13.18, was purified, sequenced to verify that no undesired changes 

15 occurred during mutagenesis, and then transformed into the T7 expression strain 
E. coli BLR(DE3). Construction of a series of clones in which the plastidial transit 
peptide was truncated at different positions was performed similarly, with the 
incorporation of the Ndel site, and thus the starting methionine, at positions 31, 42, 
48, 50, 55, and 63. The resulting plasmids are designated as pSB13.18M31, 

20 pSB13.18M42, etc., to indicate the position of truncation and of the new starting 
methionine. 

The full-length clone pMp23. 10 (SEQ ID NO: 1), acquired as above, was also 
modified by site directed mutagenesis as above to install both a 5'-NdeI site and a 3'- 
BamHl site beyond the stop codon, thereby creating pMp23.10NB, which was 

25 doubly-digested and ligated into pSBET, and designated pSB23.10. Sequencing 
revealed that no errors were introduced during mutagenesis. For co-expression 
studies, the full-length open reading frame of pMp23.10 (SEQ ID NO:l) and a 
truncation of the leader sequence were subcloned into the ampicillin resistance- 
encoding vector pET32a (Novagen) which had been digested with Ndel and BamHl 

30 and gel purified (yielding essentially pET3a with a T7lac promoter). The full-length 
clone (pMp23.10NB) was similarly double-digested and gel purified, then ligated 
into pET32a to yield pET23.10. The truncation of the plastidial transit peptide was 
created by adding a BamHl overhang downstream of the stop codon and a 5' -Ndel 
overhang (and thus the starting methionine) at residue 83 using sticky-end PGR (K. 

35 Pham et al., Biotechniques 25:206-208, 1998), thereby yielding pET23.10M83. For 
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expression of clone pET23.10 alone, this plasmid was co-transformed with pSBETa 
to take advantage of the ArgU gene of the latter. The above plasmids, as well as 
control pSBET and control pET plasmids (without insert), were transformed into 
E. coli BLR(DE3) for expression. For coexpression, E. coli BLR(DE3) was doubly 
5 transformed with pSB13.18 and pET23.10 (with dual antibiotic selection) to give 
pSB13.18-pET23.10/BLR. 

Each transformant was grown at 37 C in 1 L of LB medium (supplemented 
with 1% glucose) with kanamycin selection (for pSBET) or ampicillin selection (for 
pET), or with dual antibiotic selection (for co-expression using both plasmids), to 
10 A500'= 0-5- The transformed bacteria were then induced with 1 mM IPTG and 
allowed to express for 24 h at 15°C. The bacteria were harvested by centrifugation, 
washed once with Tris buffer (pH 7.0) containing 50 mM KCI, and resuspended in 
25 ml sonication buffer (25 mM Hepes, pH 7.2, 10 mM MgCl2, 10% glycerol, 1 
mM DTT, 1 mM EDTA and 1 mM benzamidine) and disrupted by brief sonication 
15 (VirSonic, 25% power, two 30 s bursts, 0-4°C). The sonicate was centrifuged at 
12,000^ (30 min), then at 195,000g (90 min). The supematant (soluble enzyme 
fraction) was loaded onto an HR 5/5 column containing Source 15Q anion- 
exchange separation medium (Pharmacia Biotech) that had been equilibrated with 
Hepes buffer (25 mM, pH 7.5) containing 10% glycerol, 10 mM MgCl2, 1 mM 
20 DTT and 1 mM benzamidine. A step gradient of KCI (0-85 mM (10 ml); 85 mM 
(15 ml); 85-600 mM (20 ml)) was applied, and 2 ml fractions were collected and 
assayed for prenyltransferase activity using ['4C]IPP and DMAPP (or GPP or FPP) 
as cosubstrates as described above. 

Evaluation of the functional expression of recombinant geranyl diphosphate 
25 synthase, and other prenyltransferases, is compromised by the fact that host cells 
contain competing phosphatases that can hydrolyze both DMAPP and IPP 
cosubstrates, as well as the product of the reaction. Host cells also contain 
endogenous farnesyl diphosphate synthase, capable of converting DMAPP or GPP, 
plus IPP, to FPP; this enzyme is thus capable of depleting GPP formed by a 
30 recombinant synthase. Finally, the co-substrate DMAPP, at high concentration, can 
displace bound GPP as an intermediate of FPP and GGPP syntheses, leading to 
false positive indication of the presence of GPP synthase. For these reasons, partial 
purification of the recombinant proteins by anion-exchange chromatography is 
required to separate competing activities {E. coli farnesyl diphosphate synthase 
35 elutes at ~85 mM KCI; mint recombinant geranyl diphosphate synthase elutes free 
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of competing activities at >90 raM KCl under these conditions), and empty vector 
controls are essential. Furthermore, the assay for recombinant prenyltransferase 
was designed, from studies with the native GPP synthase, to minimize false 
positives due to co-substrate effects and to incorporate appropriate controls (with 
5 boiled enzyme, no co factor, etc.) for monitoring activity. 

Confirmation of identity of geranyl diphosphate synthase clones. All 
constructs harboring clone Mpl3.18 (SEQ ID NO:10) and its truncations 
(pSB13.18/BLR, pSB13.18M31/BLR, pSB13.18M42/BLR, pSB13.18M48/BLR, 
etc.) expressed prenyltransferase activity in crude cell-free extracts at a level that 

10 correspond to that of the pSBET (empty vector) controls, suggesting the presence of 
little or no prenyltransferase activity above endogenous levels present in the host. 
Separation of proteins in these extracts by anion-exchange chromatography revealed 
that essentially all of the prenyltransferase activity eluted at 85 mM KCl, coincident 
with host-derived farnesyl diphosphate synthase. This assignment of the activity to 

15 E. coli farnesyl diphosphate synthase was confirmed by radio-gas chromatographic 
analysis of the C]5 products. Occasionally, very low levels of prenyltransferase 
activity was observed in the recombinant protein extracts, derived from several of the 
constructs, that eluted from the ion-exchange column at >90 mM KCl and this 
activity was confirmed to be geranyl diphosphate synthase by radio-gas 

20 chromatographic analysis of the C[o products. However, expression of this activity 
was always low, and often inconsistent, and in this regard the truncations were no 
better than the full-length preprotein expressed from pSB 13.1 8/BLR. 

The constructs harboring Mp23.10 (SEQ ID NO:l) and its truncation 
(pSB23.10/BLR, pET23.10/BLR and pET23. 10M83/BLR) were also tested by 

25 funcfional expression and, as with clone Mpl3.18 (SEQ ID NO: 10), the expressed 
prenyltransferase activity in crude cell-free extracts of the transformed bacteria 
evidenced no significant difference from the empty vector controls, again suggesting 
the presence of little or no recombinant prenyltransferase activity above endogenous 
levels present in the host. Separation of proteins in these extracts by anion-exchange 

30 chromatography revealed that all of the prenyltransferase activity eluted at 85 mM 
KCL, coincident with host-derived farnesyl diphosphate synthase (confirmed by 
radio-gas chromatography). Thus, clone Mp23.10 (SEQ ID NO:l) did not express 
geranyl diphosphate synthase activity, farnesyl diphosphate synthase activity, or 
geranylgeranyl diphosphate synthase activity. Significantly, based on sequence 

35 comparison, clone Mp23.10 (SEQ ID NO:l) seemed to most closely resemble 
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geranylgeranyl diphosphate synthases of plant origin, but this clone did not express a 
functional homodimeric geranylgeranyl diphosphate synthase, as might be expected 
based on literature precedent (K. Ogura and T. Koyama, Chem. Rev. 98:1263-1276, 
1998; T. Koyama and K. Ogura, in Comprehensive Natural Products Chemistry: 
5 Isoprenoids, D.E. Cane, ed.. Vol. 2, Elsevier Science, Oxford, 1999, pp. 69-96; K. 
Wang and S.-I. Ohnuma, Trends Biochem. Sci., in press). 

Given the observation that the purified, native geranyl diphosphate synthase 
from mint yielded a 28 kDa protein and a 37 kDa protein upon SDS-PAGE, it 
seemed possible that clone MplS.lS (SEQ ID NO: 10) (encoding the 28 kDa protein) 

10 and clone Mp23.10 (SEQ ID N0:1) (encoding the 37 kDa protein) might represent 
the genes encoding the small and large subunits, respectively, of a heterodimeric 
geranyl diphosphate synthase. This suggestion, however, was inconsistent with the 
homodimeric nature of all short-chain prenyltransferases thus far reported (K. Ogura 
and T. Koyama, Chem. Rev. 98:1263-1276, 1998, T. Koyama and K. Ogura, 

15 "Isopentenyl Diphosphate Isomerase and Prenyltransferases," in Comprehensive 
Natural Products Chemistry: Isoprenoids, D.E. Cane, ed.. Vol. 2, Elsevier Science, 
Oxford, 1999, pp. 69-96). Nevertheless, the possibility was examined by co- 
expression of clones Mpl3.18 (SEQ ID NO:10) and Mp23.10 (SEQ ID NO;l) using 
the transformant pSB13.18-pET23.10/BLR. Cell-free extracts of the bacteria 

20 harboring both clones yielded levels of prenyltransferase activity significantly higher 
than the corresponding empty vector controls, and separation of activities by ion- 
exchange chromatography revealed the presence of a prenyltransferase that eluted at 
>90 mM KCL and that was absent in preparations from the controls. This new, 
recombinant prenyltransferase was confirmed to be geranyl diphosphate synthase by 

25 radio-gas chromatographic analysis demonstrating the exclusive production of the 
Cio product. Since the conversion levels of the recombinant preparation approached 
those of the native enzyme under optimized assay conditions, it can be concluded 
that geranyl diphosphate synthase is a functional heterodimer comprised of a small 
subunit, such as that encoded by the cDNA insert of Mpl3.18 (SEQ ID NO:10), and 

30 a large subunit, such as that encoded by the newly isolated cDNA insert of Mp23.10 
(SEQ ID NO:l). 

Size exclusion cliromatography on a calibrated Sephacryl S-100 column 
revealed that the recombinant geranyl diphosphate synthase eluted as a single peak of 
activity at a volume corresponding to a molecular weight of 85,000 ± 8,000, 
35 consistent with the formation of a functional heterodimer of the small and large 
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subunits (i.e., 33.5 kDa plus 40.8 kDa of the respective subunit preproteins). 
Furthermore, SDS-PAGE of this partially purified material revealed the presence of 
essentially equimolar amounts of both subunits by silver staining and by calibrated 
immunoblotting with polyclonal antibodies independently raised in rabbits against 
5 each purified, denatured subunit, thus confirming the heterodimeric nature of this 
geranyl diphosphate synthase. Finally, a search of acquired random clones of the 
mint oil gland library for the large and small geranyl diphosphate synthase subunits, 
and their respective alleles, revealed abundances of 3/1200 (large subunit) and 
4/1200 (small subunit) indicating comparable levels of the corresponding messages 
10 in the original pool and again supporting a 1:1 stoichiometry of the large and small 
subunits. 

Example 3 

Sequence Analysis and Related Considerations 
The geranyl diphosphate synthase small subunit clone (SEQ ID NO: 10) (1131 

15 total nt), previously disclosed and characterized in U.S. Patent Serial No: 5,876,964 
(which patent is expressly incorporated herein by reference in its entirety), encodes 
an open reading frame of 939 nucleotides, corresponding to a preprotein of 313 
amino acids (SEQ ID N0:11) with a calculated molecular weight of 33,465. The 
first 48 deduced amino acid residues show the expected characteristics of an N- 

20 terminal plastidial targeting sequence (i.e., the sequence is rich in serine residues and 
amino acid residues with small, hydrophobic side chains, and is low in acidic 
residues (G. von Heijne et al., Eur. J. Biochem. 180:535-545, 1989)). The presence 
of such an amino-terminal targeting sequence is consistent with the plastidial origin 
of monoterpene biosynthesis in plant cells (M.L. Wise and R. Croteau, "Monoterpene 

25 Biosynthesis," in Comprehensive Natural Products Chemistry: Isoprenoids, D.E. 
Cane, ed.. Vol. 2, Elsevier Science, Oxford, 1999, pp. 97-153), and with the 
localization of this enzyme exclusively within the plastids (E. Soler et al., Planta 
187:171-175, 1992; G. Turner and R. Croteau, unpublished). By excluding the 
putative transit peptide of the preprotein, the amino acid sequence corresponds to a 

30 deduced mature, processed protein of molecular weight 28,485, in full agreement 
with a size of 28 ± 1 kDa determined for this subunit of the native enzyme by SDS- 
PAGE. 

The newly discovered geranyl diphosphate synthase large subunit clone (SEQ 
ID NO:l) (1341 total nt) encodes an open reading frame of 1131 nucleotides. 
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corresponding to a preprotein of 377 amino acids (SEQ ID NO: 2) with a calculated 
molecular weight of 40,800. The first 40 deduced amino acid residues show the 
expected characteristics of an N-terminal plastidial targeting sequence (ChloroP 
predictor, web server). By excluding the putative transit peptide in this case, the 
5 sequence corresponds to a deduced mature, processed protein of molecular weight of 
36,400, in full agreement with a size of 37 ± 1 kDa determined for this subunit of the 
native enzyme by SDS-PAGE. Given these considerations, the size of the functional 
heterodimer, following import, proteolytic processing and assembly in the plastids, 
would be predicted to be ~65 kDa (i.e., 28.5 kDa + 36.4 kDa for the processed 

10 forms), which is consistent with a size of 70 ± 7 kDa determined by gel permeation 
chromatography (on Superdex 75) of the native geranyl diphosphate synthase 
isolated from mint oil glands. 

It is notable that the constituent sequences of the geranyl diphosphate (Cjo) 
synthase more closely resemble those of plant-derived geranylgeranyl (C20) 

15 diphosphate synthase than farnesyl (C15) diphosphate synthase. Thus, the small 
subunit exhibits 26-30% identity and 54-56% similarity to GGPP synthase 
preproteins but only 17-18%> identity and 37-42% similarity to FPP synthases. For 
the large subunit, the resemblance is more striking; 65-72% identity and 76-88% 
similarity to GGPP synthase preproteins but only 18-26% identity and 42-48% 

20 similarity to FPP synthases. These observations suggest the evolutionary origin of 
both GPP synthase subunits from GGPP synthase, which is also plastidial, not from 
FPP synthase, which is a cytosolic enzyme. Since it is clear from the expression 
studies that the large subunit, although it resembles GGPP synthase, is not a GGPP 
synthase, it is of interest to note that a GGPP synthase enzyme, or expressed gene, 

25 has not yet been verified to be present in mint oil glands, although it must be 
fimctional elsewhere in mint leaves for essential metabolic purposes. 

Example 4 

Characteristics of Presently Preferred Nucleic Acid Molecules that Encode Geranyl 

Diphosphate Synthase Small Subunit Proteins 
30 Presently preferred nucleic acid molecules that encode a geranyl diphosphate 

synthase small subunit protein useful in the practice of the present invention (for 
example, for coexpression with geranyl diphosphate synthase large subunit protein in 
a host cell) are capable of hybridizing to the nucleic acid sequence set forth in SEQ 
ID NO: 10, or to the complementary sequence of the nucleic acid sequence set forth 
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in SEQ ID NO: 10, under the following stringent hybridization conditions: 
incubation in 5 X SSC at 65°C for 16 hours, followed by washing under the 
following conditions: two washes in 2 X SSC at 18 C to 25 C for twenty minutes per 
wash, followed by one wash in 0.5 X SSC at 55°C for thirty minutes; most 
5 preferably, two washes in 2 X SSC at 18°C to 25°C for fifteen minutes per wash, 
followed by two washes in 0.2 X SSC at 65°C for twenty minutes per wash. 

The ability of presently preferred nucleic acid molecules that encode a 
geranyl diphosphate synthase small subunit protein to hybridize to the nucleic acid 
sequence set forth in SEQ ID NO: 10, or to the complementary sequence of the 
10 nucleic acid sequence set forth in SEQ ID NO: 10, can be determined utilizing the 
technique of hybridizing radiolabelled nucleic acid probes to nucleic acids 
immobilized on nitrocellulose filters or nylon membranes as set forth, for example, at 
pages 9.52 to 9.55 of Molecular Cloning, A Laboratory Manual (2nd edition), J. 
Sambrook, E.F. Fritsch and T. Maniatis eds, the cited pages of which are 
1 5 incorporated herein by reference. 

The presently most preferred nucleic acid molecule encoding a geranyl 
diphosphate synthase small subunit protein is the nucleic acid molecule having the 
nucleic acid sequence set forth in SEQ ID NO: 10. 

Example 5 

20 Characteristics of Presently Preferred Geranyl Diphosphate Synthase Small Subunit 

Proteins 

Presently preferred geranyl diphosphate synthase small subunit proteins 
useful in the practice of the present inyention (for example, for coexpression with 
geranyl diphosphate synthase large subunit protein in a host cell) possess tlie 
25 properties set forth in Table 1 (geranyl diphosphate synthase small subunit is 
functional in the absence of geranyl diphosphate synthase large subunit, but at only 
about 1% of the activity level of the geranyl diphosphate synthase heterodimer). The 
presently most preferred geranyl diphosphate synthase small subunit protein has the 
amino acid sequence set forth in SEQ ID NO:l 1 . 
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Table 1 

Properties of Presently Preferred Geranyl Diphosphate Synthase Small Subunit 
Proteins 



Cofactor requirement: 


Divalent metal ion (usually Mg++ or Mn"^"*", potentially 
re^, Co^^, Zn^^) 


pH optimum: 


from about pH 6.2 to about pH 7.8 


pi: 


acidic, from about pH 4.5 to about pH 6.0 


Kj„ (isopentenyl 
diphosphate): 


<20 |iM 


(dimethylallyl 
diphosphate); 


<50 i^M 


Kj^ (metal ion): 


Mg++ <5 mM; Mn++ <1 mM 


^cat- 


<5/sec 


Architecture: 


Monomers or homodimers, with monomer molecular 
weight from about 30 kD to about 50 kD 


Other properties: 


Most are plastid-directed, operationally soluble, but 
relatively unstable enzymes. Highly specific for 
dimethylallyl diphosphate as allylic cosubstrate and for 
geranyl diphosphate as product (do not elongate beyond 
C|o)- Inhibited by histidine- and arginine-directed 
reagents. 



Example 6 

Characteristics of Presently Preferred Nucleic Acid Molecules that Encode Geranyl 

Diphosphate Synthase Large Subunit Proteins 
Presently preferred nucleic acid molecules that encode a geranyl diphosphate 
synthase large subunit protein useful in the practice of the present invention (for 



example, for coexpression with geranyl diphosphate synthase small subunit protein 
in a host cell) are capable of hybridizing to the nucleic acid sequence set forth in SEQ 
ID NO:l, or to the complementary sequence of the nucleic acid sequence set forth in 
SEQ ID NO:l, under the following stringent hybridization conditions: incubation in 
5 5 X SSC at 65°C for 16 hours, followed by washing under the following conditions: 
two washes in 2 X SSC at 18°C to 25°C for twenty minutes per wash, followed by 
one wash in 1.0 X SSC at 55°C for thirty minutes, more preferably followed by two 
washes in 0.5 X SSC at SS'C for twenty minutes per wash. 

The ability of presently preferred nucleic acid molecules that encode a 

10 geranyl diphosphate synthase large subunit protein to hybridize to the nucleic acid 
sequence set forth in SEQ ID N0:1, or to the complementary sequence of the nucleic 
acid sequence set forth in SEQ ID N0:1, can be determined utilizing the technique of 
hybridizing radiolabelled nucleic acid probes to nucleic acids immobilized on 
nitrocellulose filters or nylon membranes as set forth, for example, at pages 9.52 to 

15 9.55 of Molecular Cloning, A Laboratory Manual (2nd edition), J. Sambrook, E.F. 
Fritsch and T. Maniatis eds, the cited pages of which are incorporated herein by 
reference. 

Example 7 

Characteristics of Presently Preferred Geranyl Diphosphate Synthase Large Subunit 

20 Proteins 

Presently preferred geranyl diphosphate synthase large subunit proteins useful 
in the practice of the present invention (for example, for coexpression with geranyl 
diphosphate synthase small subunit protein in a host cell) are capable of forming a 
functional heterodimer with geranyl diphosphate synthase small subunit protein. The 

25 resulting geranyl diphosphate synthase heterodimer is capable of catalyzing the 
condensation of dimethylallyl diphosphate (DMAPP) and isopentenyl diphosphate 
(IPP) to form geranyl diphosphate, and possesses the properties set forth in Table 2. 



30 



-48- 



Table 2 

Properties of Geranyl Diphosphate Synthase Heterodimers Including Presently 
Preferred Geranyl Diphosphate Synthase Large Subunit 



Cofactor requirement: 


Divalent metal ion (usually Mg++ or Mn+"^, potentially 
Fe++, Co++, Zn++) 


pH optimum: 


from about pH 6.2 to about pH 7.8 


pi: 


acidic, from about pH 4.5 to about pH 6.0 


Kfri (isopentenyl 
diphosphate): 


<20 i^M 


Kj„ (dimethylallyl 
diphosphate): 


<50 \xM 


Km (metal ion): 


Mg++ <5 mM; Mn++ <1 mM 


^cat- 


<5/sec 


Architecture: 


Native heterodimeric protein of 60-100 kD 


Other properties: 


Most are plastid-directed, operationally soluble, but 
relatively unstable enzymes. Highly specific for 
dimethylallyl diphosphate as aliylic cosubstrate and for 
geranyl diphosphate as product (do not elongate beyond 
Cio). Inhibited by histidine- and arginine-directed 
reagents. 



5 



Additionally, presently preferred geranyl diphosphate synthase large subunit 
proteins useful in the practice of the present invention are recognized by antibodies 
raised against the geranyl diphosphate synthase large subunit protein having the 
amino acid sequence disclosed in SEQ ID N0:2. Antibodies can be raised against 
10 geranyl diphosphate synthase large subunit protein by any art-recognized means. 
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Methods for preparing monoclonal and polyclonal antibodies are well known to those 
of ordinary skill in the art and are set forth, for example, in chapters five and six of 
Antibodies A Laboratory Manual, E. Harlow and D. Lane, Cold Spring Harbor 
Laboratory (1988), the cited chapters of which are incorporated herein by reference. 
5 For example, polyclonal antibodies have been successfully raised against the geranyl 
diphosphate synthase large subunit protein having the amino acid sequence disclosed 
in SEQ ID N0:2 by first purifying this protein by anion exchange chromatography 
followed by excision of the Coomassie Blue-stained protein from an SDS-PAGE gel. 
About 1.5 mg of the geranyl diphosphate synthase large subunit protein having the 
10 amino acid sequence disclosed in SEQ ID N0:2 was excised from a Coomassie Blue- 
stained SDS-PAGE gel and used to inject two rabbits (100 pg per injection). 
Antibodies were bled on the 7th and 9th week after injection. 

Example 8 

A PGR Strategy for Cloning Nucleic Acid Molecules Encoding Geranyl Diphosphate 

15 Synthase Large Subunit 

The following PGR strategy can be utilized to clone additional nucleic acid 
molecules (preferably cDNA molecules) of the present invention that encode a 
geranyl diphosphate synthase large subunit protein. The forward primer for the PGR 
reaction has the sequence: AAR CGM ACN AAY CAY ATG (SEQ ID NO: 14) 

20 (corresponding to amino acids Lys' ''^ through Met'^"* of SEQ ID N0:2). The reverse 
primer for the PGR reaction has the sequence: YG RTG NGG RTG RAA RTG (SEQ 
ID NO: 15) (corresponding to amino acids Arg^^' through His^'^ of SEQ ID NO:2). 
A 100 ^il PGR reaction contains: 20mM Tris-HGi (pH8.4), 50mM KCl, 3.5mM 
MgGl2, 250pM of each^dNTP, 0.1 jaM of each primer, 2.5 units of Taq DNA 

25 polymerase, and 1000 to 1,000,000 template molecules (such as cDNA molecules). 
Representative temperature cycling conditions are: 35 cycles, each cycle including 1 
min at 94''C to denature, 1 min at 50 G to anneal, 1 min at 72 C to extend. 

While the preferred embodiment of the invention has been illustrated and 
described, it will be appreciated that various changes can be made therein without 

30 departing from the spirit and scope of the invention. 



